Appears On
Episode Appearances
Latent Space: The AI Engineer Podcast · Feb 23, 2026
⚡️SWE-Bench-Dead: The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals & Human Data
“OpenAI Frontier Evals team member, co-creator of SWE-Bench Verified”
SWE-Bench Verified deprecationAI coding benchmark contaminationSWE-Bench Pro adoption
View Analysis