Appears On
Episode Appearances
Latent Space: The AI Engineer Podcast · Feb 23, 2026
⚡️SWE-Bench-Dead: The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals & Human Data
“VP of Research at OpenAI, leads Codex, Simulator, and Alignment teams”
SWE-Bench Verified deprecationAI coding benchmark contaminationSWE-Bench Pro adoption
View Analysis