Create your account

Analyse episodes, create alerts, spot trends before they go mainstream

Already have an account? Sign in

Topics

Frontier AI model evaluation

Discussed in 1 analyzed podcast episode across 1 show

Episodes

Latent Space: The AI Engineer Podcast

Latent Space: The AI Engineer Podcast · Feb 23, 2026

⚡️SWE-Bench-Dead: The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals & Human Data

View Analysis