How do you verify what an AI system is actually doing, from training through deployment?
That question brought researchers and engineers together on May 17 for the Assurance & Verification of AI Development (AViD) Workshop, co-hosted by FAR.AI and the Center for AI Safety.
The technical foundations for AI assurance are still being built. As AI capabilities advance, the infrastructure for monitoring, auditing, and verifying these systems has to advance with them. AViD focused on the concrete use cases driving this work: domestic auditing, international treaty verification, and internal oversight for AI developers. Sources of trust spanned existing hardware (TEEs, hardware root of trust), novel hardware, and cryptographic methods.
Will Hodgkins (Center for AI Safety) opened and closed the day. Lightning talks featured:
▸ Shahin Tajik (Worcester Polytechnic Institute), physical verification against nation-state adversaries
▸ Koen van der Veen (OpenMined), PySyft v2 for auditing private models and user logs
▸ Bing-Jyue Chen (UIUC), efficient zero-knowledge proofs for AI inference
▸ Roy Rinberg (Harvard University), inference verification in a TEE
▸ Adam Chlipala (Massachusetts Institute of Technology), end-to-end formal verification of computing infrastructure
▸ Amean Asad (Confidential AI), scalable private inference for frontier AI
▸ quintus K. (Flashbots), trust in silicon
▸ Ari Juels (Cornell University), security tools from crypto for AI
The open questions surfaced throughout the day point to where this field needs to go next: making inference replay production-ready, operationalizing memory challenges to prevent unauthorized inference, scaling zero-knowledge proofs to frontier LLMs, verifying training at frontier scale, and establishing trust in verification infrastructure itself.
Rapid progress toward models capable of causing catastrophic harm makes responsible development more urgent than ever. Independent verification gives society the information it needs to respond and adapt. FAR.AI is committed to advancing the technical work that makes AI safety real.
Thank you to all the speakers and attendees moving this forward. Recordings to come in the coming weeks; follow FAR.AI to catch them.
What verification challenge feels most pressing to you? Let us know in the comments.