🔬Why There Is No "AlphaFold for Materials" — AI for Materials Discovery with Heather Kulik

35 min

•Mar 24, 20262 months ago

Summary

Heather Kulik, MIT professor of chemical engineering, discusses the intersection of AI and materials discovery, explaining why there's no 'AlphaFold for materials' yet. She covers her work using machine learning to accelerate materials discovery, including a breakthrough in creating polymers four times tougher through AI-discovered quantum mechanical phenomena.

Insights

AI in materials science is most powerful for solving multidimensional optimization problems with 7+ objectives, providing 100-1000x speedup per dimension
Current machine learning models for materials often fail catastrophically when applied to real problems, with foundation models being only 5x faster than traditional methods
The biggest bottleneck in materials discovery is the interface between computational predictions and experimental validation
LLMs have Wikipedia-level chemistry knowledge but fail at basic tasks like designing molecules with specific atom counts
Materials science lacks the experimental ground truth datasets that enabled AlphaFold's success in protein folding

Trends

Shift from accelerating existing computational methods to discovering novel chemical phenomena through AIGrowing investment in materials discovery startups creating competitive pressure on academic researchNeed for standardized data reporting in materials science to enable machine learning applicationsDevelopment of autonomous labs and high-throughput experimentation facilitiesIntegration of textual information from scientific literature into ML models for materials discovery

Topics

AI for Materials Discovery Active Learning in Chemistry Metal Organic Frameworks Quantum Mechanical Modeling Machine Learning Potentials Transition Metal Catalysis Polymer Network Design CO2 Capture Materials High-Throughput Experimentation Literature Data Extraction DFT Calculations Chemical Bonding Prediction Materials Stability Prediction Autonomous Laboratory Systems Foundation Models for Chemistry

Companies

AstraZeneca

Employs John Paul Genet as assistant director running their inverse design program in Sweden

Microsoft

Mentioned as having infinite computational resources that academics cannot compete with

Meta

Cited alongside Microsoft as having vastly superior computational resources compared to academia

People

Heather Kulik

Main guest discussing AI applications in materials discovery and computational chemistry

John Paul Genet

Former student who adapted materials design concepts into neural networks as a class project

Quotes

"The thing I constantly do every time an LLM is updated is I just ask it, please design me a ligand that has 22 atoms. I can never get an answer that has 22 atoms."

Heather Kulik

"ChatGPT is super good at Wikipedia level chemistry knowledge."

Heather Kulik

"If I could just give up ever doing a DFT calculation again and just rely on machine learned potentials and if they were two orders of magnitude faster than the traditional approach, that would change how we're doing science."

Heather Kulik

"You should learn chemistry well enough to know when these models are right or wrong. And if you don't know any chemistry at all, it's hard to know if you're assessing correctly."

Heather Kulik

Full Transcript

3 Speakers

Speaker A

There's a school of thought that why should I bother to learn chemistry or physics or whatever when ChatGPT has PhD level understanding of that?