AI incidents, audits, and the limits of benchmarks

43 min

•Feb 13, 20265 months ago

Summary

Sean McGregor, founder of the AI Incident Database and co-founder of the AI Verification and Evaluation Research Institute, discusses AI safety through incident documentation, third-party auditing, and evaluation of AI systems. The episode covers how AI incidents are tracked, the limitations of current benchmarks, and the need for systematic approaches to AI safety verification.

Insights

AI safety requires systematic documentation and analysis of incidents, similar to aviation crash reporting, to prevent recurring failures
Current AI benchmarks are primarily designed for research purposes rather than practical deployment decisions, creating gaps in real-world safety assessment
Third-party auditing of AI systems is becoming essential as models become more general-purpose and harder to evaluate in specific contexts
The interface between multiple AI systems (like guard models and foundation models) often represents the weakest security point
Statistical rigor is necessary when evaluating AI vulnerabilities, as anecdotal exploits don't represent systematic security flaws

Trends

Shift from voluntary to mandatory AI incident reporting, particularly in EU regulationsGrowing need for specialized AI auditing and evaluation services as third-party industryEvolution of AI safety from context-specific to general-purpose system evaluation challengesIntegration of security and safety practices from other industries into AI developmentDevelopment of standardized flaw reporting systems for AI similar to traditional software bug bountiesIncreasing collaboration between security researchers and AI safety communitiesMeta-evaluation becoming critical as benchmark reliability comes under scrutiny

Topics

AI Incident Documentation Third-Party AI Auditing AI Safety Benchmarks Meta-Evaluation of AI Systems AI Security Vulnerabilities Mandatory Incident Reporting AI System Integration Risks Generative AI Red Teaming AI Verification Standards Statistical AI Security Assessment AI Governance and Compliance Foundation Model Safety AI Guard Model Limitations Systematic AI Risk Management

Companies

Prediction Guard

Daniel Whitenack's company, mentioned as CEO and show partner providing operational support

Lockheed Martin

Chris Benson's employer where he works as principal AI research engineer

AI Verification and Evaluation Research Institute

Sean McGregor's organization focused on third-party AI auditing and safety verification

Cintient

Company where McGregor worked on energy efficient neural network processors for consumer devices

Underwriters Laboratories

Safety organization that acquired McGregor's previous company's assets for AI safety work

OpenAI

Mentioned as example of frontier AI model company requiring third-party safety auditing

Anthropic

Cited as frontier AI model company alongside OpenAI and Google for safety evaluation needs

Google

Referenced for Gemini model as example of general-purpose AI system requiring safety verification

Allen Institute for AI

Provided the 7 billion parameter language model used in DEFCON red teaming exercise

People

Sean McGregor

Main guest, founder of AI Incident Database and co-founder of AI Verification Research Institute

Daniel Whitenack

Podcast host and CEO of Prediction Guard, leads discussion on AI safety and incidents

Chris Benson

Co-host and principal AI research engineer at Lockheed Martin, asks about safety definitions

Aishwarya

Colleague mentioned in connection to bench risk meta-evaluation work that connected hosts to guest

Quotes

"You don't want a bad thing to happen and that you don't want that bad thing to produce a harm. You don't want someone to say like, I've been impacted or some organization been impacted."

Sean McGregor

"The world is hard. The real world is real hard."

Sean McGregor

"You manage what you measure. And so let's measure these risks, and then we can separate the actors that are investing in safety and stronger AI systems, safer AI systems, from the ones that aren't doing that."

Sean McGregor

"Anecdote does not equal data in this instance. And we need you to show that it's systematically like pushing towards arson, that it's always going to burn things down."

Sean McGregor

Full Transcript

4 Speakers

Speaker A

Welcome to the Practical AI Podcast where we break down the real world applications of artificial intelligence and how it's shaping the way we live, work and create. Our goal is to help make AI technology practical, productive and accessible to everyone. Whether you're a developer, business leader or just curious about the tech behind the buzz, you're in the right place. Be sure to connect with us on LinkedIn X or Bluesky to stay up to date with episode drops, behind the scenes and AI insights. You can learn more at PracticalAI FM. Now onto the show.