Skip to main content
Brooks McMillin
  • Home
  • About
  • Projects
  • Appearances
  • Blog
  • Work
  • Contact

AI Security

1 post in this category. View all categories

A Coding Agent Read a File That Didn't Exist Five Times, Then Blamed the Tools

June 3, 2026 10 min read

A Claude Code session confabulated a nonexistent Python file, persisted against five truthful "does not exist" errors, then self-diagnosed as corrupted tool output. A reconstruction from the raw transcript, a corpus scan across 3,001 sessions on whether the failure is worse in Opus 4.8, and a model-independent mitigation.

#ai-security#agents#claude-code#llm-failure-modes
Read article →

© 2026 Brooks McMillin