/blog

Notes from the AI red team

Analysis of AI agent vulnerabilities, attack techniques, and defensive patterns — plus findings from scans I run against public targets.

June 12, 2026·5 min read

What I Learned Cataloguing Every AI Security Incident I Could Find

I built a sourced database of real-world AI and LLM security incidents. Putting them in one place surfaced three patterns you do not see one headline at a time: a single exfiltration channel that keeps working across vendors, indirect injection as the real attack surface, and the 2025 shift to agents and supply chain.

Read post →
audit
May 5, 2026·4 min read

The Audit-Framing Trick: How AI Memory Becomes a Side Door

A junior contractor doesn't have access to the CFO's salary review notes. But they have edit access to a shared Notion page, and the company AI assistant indexes Notion. Three days later, every employee can ask the AI for a 'memory diagnostic' and get the CFO's notes back.

Read post →
1.2.3.?4.?5.?
April 25, 2026·7 min read

The OWASP LLM Top 10 Is Missing Three Categories

The OWASP Top 10 for LLM Applications is the best framework we have. It also has three blind spots that account for a disproportionate share of what I'm finding in the field — multi-tenant context bleed, agent-to-agent handoff attacks, and temporal/memory attacks.

Read post →
April 16, 2026·2 min read

Why I Built Wraith

Most security tools don't know how to test AI agents. That's a gap worth building a product around.

Read post →