Gemini 3.1 Pro, Claude Sonnet 4.6 & The OpenClaw Hire That Killed the Chatbot Era - EP99.35

58 min

•Feb 20, 20263 months ago

Summary

The hosts discuss two new AI models: Google's Gemini 3.1 Pro and Anthropic's Claude Sonnet 4.6, analyzing their performance in agentic workflows and tool calling capabilities. They also cover OpenAI's acquisition of OpenClaw, criticizing OpenAI's lack of focus compared to Anthropic's consistent model improvements.

Insights

Smaller, cheaper AI models like Haiku are performing nearly as well as frontier models in agentic workflows when given proper context
The future of AI applications will likely use model mixing - different models for different tasks based on cost, speed, and accuracy requirements
OpenAI's acquisition of OpenClaw suggests they're struggling to innovate internally and are buying distribution rather than building
Tool calling accuracy and reduced hallucination are becoming more important than raw intelligence for practical AI applications
Current AI model pricing may be unsustainable for mass enterprise deployment, favoring optimization toward smaller models

Trends

Shift from single frontier models to multi-model architectures for different tasksIncreasing importance of agentic workflows over single-shot AI interactionsGrowing focus on cost optimization and token efficiency in AI applicationsMovement toward specialized model variants (agentic vs conversational)Rising significance of tool calling accuracy over creative capabilitiesCommoditization of AI model capabilities across different providersEnterprise adoption limited by current pricing models for frontier AIOpen source AI agents gaining traction against proprietary solutions

Topics

AI Model Benchmarking Agentic AI Workflows AI Model Pricing Strategy Tool Calling Capabilities AI Model Hallucination Context Window Optimization Sub-agent Architecture AI Model Acquisition Strategy Open Source AI Agents Enterprise AI Deployment AI Model Performance Comparison Token Cost Optimization AI Safety and Regulation Computer Use AI Applications AI Model Specialization

Companies

Google

Released Gemini 3.1 Pro with improved Arc AGI benchmark performance and thinking control features

Anthropic

Released Claude Sonnet 4.6 as a cheaper alternative to Opus 4.6 with similar capabilities

OpenAI

Acquired OpenClaw and criticized for lack of internal innovation compared to competitors

Apple

Mentioned as using Gemini in Siri, potentially problematic due to poor tool calling capabilities

Salesforce

Used as example of complex software that AI still cannot effectively administer or replace

Amazon

Referenced regarding internal tool usage policies and DHH moving off their hosting

Spotify

Platform where the hosts' AI-generated song 'Is This the End' is available

People

Peter Steinberger

Creator of OpenClaw who was acquired by OpenAI and joined the company

Dario Amodei

Anthropic CEO mentioned regarding AI impact on jobs and awkward interaction with Sam Altman

Sam Altman

OpenAI CEO referenced in awkward hand-holding moment with Dario at AI summit

David Heinemeier Hansson

Praised for cost-effective AI usage, spending only $10 on 3 million tokens

Geoffrey Hinton

Used as example in AI model testing for creating a 'doom center' monitoring application

Johnny Ive

Mentioned regarding previous collaboration announcement with OpenAI on device projects

Quotes

"I plan on running these things like mad. I plan on looping them for 80 iterations, 100 iterations. I can't afford to run any of the top line models at their current prices"

Host•Mid-episode

"When the models get it wrong, they get it wrong on such a scale. You're like, how do I unpick all the damage it's done?"

Host•Early episode

"If there's a bubble in AI, it's pricing a million tokens at $25 and beyond"

David Heinemeier Hansson•Mid-episode

"No one can afford to run out, like roll out the anthropic top tier models right now on a mass scale inside an enterprise"

Host•Late episode

Full Transcript

2 Speakers

Speaker A

Chris, this week we have not one, but two new models. Gemini 3.1 Pro released just a couple of hours ago, and also a new Claude Sonnet 4.6. But before we get into talking about the new models, a few little reminders. First of all, I heard you loud and clear. People wanted the song from last week. Is this the end little reminder here.