Appears On
Episode Appearances
The Run-Up AI · May 5, 2026
Cerebras IPO: Insights and Expectations
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on AI model evaluation ceiling”
The Last Invention is AI · May 5, 2026
Government & AI: Unreleased Models Explored
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on AI model evaluation ceiling”
Claude AI · May 5, 2026
Key Moments from Anthropic's Wall Street Day
“Co-author of Harvard study comparing O1 model to ER physicians; noted evaluation methods now hitting ceiling with multiple-choice tests”
ChatGPT News · May 5, 2026
DeepMind’s Unionization Move: Industry Impact
“Co-author of Harvard study comparing OpenAI O1 to ER physicians; quoted on ceiling of multiple-choice evaluation”
Anthropic · May 5, 2026
Understanding AI Models: Government Interests
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on model evaluation ceiling”
AI Breakdown · May 5, 2026
DeepMind Takes Steps Towards Unionization
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on AI diagnostic capabilities reaching ceiling”
The AI Podcast · May 5, 2026
Anthropic's Wall Street Day, US Gov Wants Unreleased Models, DeepMind Unionizes, Cerebras IPO
“Co-author of Harvard study comparing OpenAI O1 to ER physicians; quoted on ceiling of multiple-choice testing”
Personal AI · May 5, 2026
Behind the Scenes of Anthropic's Day
“Co-author of Harvard study comparing OpenAI O1 to ER physicians; quoted on model evaluation methodology”
No Priors AI · May 5, 2026
The Significance of AI Unionization
“Co-author of Harvard study comparing OpenAI O1 to physician diagnostic accuracy; quoted on model evaluation ceiling”
Latent Space AI · May 5, 2026
The Importance of AI Model Release Agreements
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on reaching ceiling of multiple-choice evaluation metrics”
Global News Podcast · May 5, 2026
The Future of AI Model Regulations
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on ceiling of multiple-choice testing and need for new evaluation methods”
Building AI: News on OpenAI's ChatGPT, Anthropic's Claude, Google Gemini and xAI's Grok · May 5, 2026
DeepMind's Unionization: Implications for AI
“Co-author of Harvard study comparing O1 model to ER doctors; quoted on model evaluation ceiling being reached”
The Daily AI · May 5, 2026
DeepMind's Unionization: Trends Defined
“Co-author of Harvard study comparing OpenAI O1 to ER doctors; quoted on ceiling of multiple-choice testing”