🔬 Automating Science: World Models, Scientific Taste, Agent Loops — Andrew White

74 min

•Jan 28, 20264 months ago

Summary

Andrew White, co-founder of Future House and Edison Scientific, discusses his journey from academia to automating scientific discovery using AI agents. The conversation covers his work on Cosmos, a world model for scientific research, and explores how AI can automate the cognitive processes of hypothesis generation, experiment design, and data analysis in scientific workflows.

Insights

Scientific automation is more about automating cognitive processes (hypothesis generation, experiment design, analysis) rather than just modeling specific systems
The bottleneck in AI-driven science is often mundane logistics (reagent availability, lead times) rather than model intelligence
Scientific taste - knowing what constitutes interesting vs boring results - remains a key frontier for AI systems
Enumeration and filtering strategies can be more effective than trying to be smarter, allowing AI to try more ideas faster than humans
The transition from first-principles simulation (MD, DFT) to machine learning on experimental data represents a fundamental shift in computational science

Trends

Shift from foundation models for specific domains to general scientific reasoning agentsIntegration of literature search, data analysis, and experimental design in unified AI workflowsMovement from academic research to venture-backed startups for AI science applicationsIncreasing automation of wet lab experiments through cloud labs and CROsFocus on verifiable rewards and human feedback for scientific AI trainingGrowing importance of provenance and citation tracking in AI-generated scientific contentEmergence of world models as memory systems for scientific discoveryTransition from simulation-heavy approaches to experimental data-driven methods

Topics

AI for Scientific Discovery Automated Laboratory Workflows Scientific Hypothesis Generation World Models for Science Molecular Dynamics Simulation Protein Folding Drug Discovery Scientific Taste and Evaluation AI Safety in Dual-Use Research Academic to Industry Transition Focus Research Organizations Scientific Agent Architecture Literature Mining and Analysis Chemical Synthesis Planning Bioweapon Risk Assessment

Companies

University of Rochester

University of Washington

People

Quotes

"When AlphaFold came out and it's like you can do it in Google Colab, you know, or on a GP or desktop, it was mind blowing. I forget like that protein folding was solved. I always thought that was inevitable. But the fact that it was solved and on like your desktop you can do it was just completely floored changed everything."

Andrew White

"We're trying to automate the cognitive process of scientific discovery. Making hypotheses, choosing experiments to do, analyzing the results from experiments and using it to update your hypotheses or your confidence in those hypotheses, and then leading to a world model which of like, okay, this is how I understand this process to be."

Andrew White

"I think molecular dynamics is overrated. In fact, coming from someone. In the. Thumbnail, you know, and DFT is overrated. In fact, DFT may be even more overrated than, like, the dynamics."

Andrew White

"I think the easy way to succeed in AI over humans is you can try more ideas faster."

Andrew White

"I think there's an unlimited amount of scientific discoveries to be made. So there's no scarcity set where basically we will displace them all."

Andrew White

Full Transcript

3 Speakers

Speaker A

MD was supposed to be the protein folding solution. There is a great counterexample. The counterfactual is basically a group called Deserez de Shaw Research. They had, you know, similar funding to DeepMind, probably more actually. They tested the hypothesis to death that MD could fold proteins. They built their own silicon, they built their own clusters, they had them taped out all themselves. They burned into the silicon the algorithms to run md. They ran MD at huge speeds, huge scales. I remember David Shaw came to a company conference once on MD and he flew in by helicopter and this pretty famous guy, kind of rich, and he gave an amazing presentation about the special computers and special room and outside of Times Square and like what they can do with it. It's beautiful, amazing. And I always thought that protein folding would be solved by them, but it would require a special machine. Maybe the government would buy like five of these things and we could fold, you know, maybe one protein a day or two proteins a day. And when AlphaFold came out and it's like you can do it in Google Colab, you know, or on a GP or desktop, it was mind blowing. I forget like that protein folding was solved. I always thought that was inevitable. But the fact that it was solved and on like your desktop you can do it was just completely floored changed everything.