How Foundation Models Evolved: A PhD Journey Through AI's Breakthrough Era

57 min

•Jan 16, 20265 months ago

Summary

MIT professor Omar Khattab discusses his framework DSPy and argues that the path to AI progress isn't through AGI but through 'artificial programmable intelligence' - building structured systems around language models rather than relying on raw model scaling. He advocates for formal abstractions that allow developers to declare intent without drowning in implementation details.

Insights

The industry has moved away from believing that scaling model parameters and pre-training data alone will solve AI problems, now focusing on post-training pipelines, retrieval, and tool use
Natural language prompting is too ambiguous for building reliable AI systems, while traditional programming is too rigid - a new abstraction layer is needed that combines both
AI systems should be built as programmable, modular systems rather than monolithic models, similar to how software engineering evolved from assembly to higher-level languages
The real challenge isn't model capabilities but specification - helping humans articulate what they actually want from AI systems in a structured way
DSPy represents a paradigm shift from imperative to declarative programming for LLMs, allowing developers to specify intent while the system handles optimization

Trends

Shift from pure model scaling to systems-based approaches in AI developmentGrowing emphasis on post-training optimization and human feedback integrationMovement toward declarative programming paradigms for AI systemsIncreased focus on AI system composability and modularityEvolution from prompt engineering to formal AI programming frameworksIntegration of reinforcement learning techniques with language model optimizationDevelopment of context-scaling techniques for handling longer inputsEmergence of 'artificial programmable intelligence' as alternative to AGI pursuit

Topics

Artificial Programmable Intelligence DSPy Framework Foundation Models Prompt Optimization AI System Architecture Declarative vs Imperative Programming Language Model Scaling Post-training Optimization AI Agent Systems Reinforcement Learning for LLMs Context Length Scaling AI Software Engineering Model Capabilities vs Specification Retrieval and Tool Use AI System Modularity

Companies

People

Quotes

"Nobody wants intelligence, period. I want something else, right? And that something else is always specific, or at least more specific."

Omar Khattab

"It's not a problem of capabilities, it's a problem of actually. We don't necessarily just need models, we want systems."

Omar Khattab

"I'm interested in API or artificial programmable intelligence. And the reason I say this is why are we building AI? I think fundamentally it's in my opinion a way of improving and expanding the set of software systems we can build."

Omar Khattab

"That idea that scaling model parameters and scaling just pre training data is all you need exists nowhere anymore. Nobody thinks that actually people deny they ever thought that at this point."

Omar Khattab

"The question sounds to me, don't you have chairs at home? Don't you wish that they all looked like tables? I need both."

Omar Khattab

Full Transcript

3 Speakers

Speaker A

Nobody wants intelligence, period. I want something else, right? And that something else is always specific, or at least more specific. There is this kind of observed phenomenon where if you over engineer intelligence, you regret it because somebody figures out a more general and maybe potentially simpler method that scales better. And a lot of the hard coded decisions you made are things you end up regretting. So I think it's fair to assume that like models will get better and algorithms will get better and a lot of that stuff will improve. Then the question we really ask is intelligence is great, but what problems are you actually trying to solve? That idea that scaling model parameters and scaling just pre training data is all you need exists nowhere anymore. Nobody thinks that actually people deny they ever thought that at this point. Now you see this massively human designed and very carefully constructed pipelines for post training where we really encode a lot of the things we want to do. You see massive emphasis on retrieval and web search and tool use and agent training. There is clearly a sense in which the labs have already recognized that the old playbook doesn't work. The question is, is that actually sufficient for making the best use and the most used of these language models? It's not a problem of capabilities, it's a problem of actually. We don't necessarily just need models, we want systems.