🔬Beyond AlphaFold: How Boltz is Open-Sourcing the Future of Drug Discovery

81 min

•Feb 12, 20264 months ago

Summary

Gabriela Corso and Jeremy Volvind from Boltz discuss their journey from AlphaFold 2's breakthrough in protein folding to creating open-source alternatives like Boltz-1 and building a commercial platform for drug discovery. They cover the technical evolution from structure prediction to protein design, the importance of experimental validation, and their mission to democratize access to AI-powered molecular design tools.

Insights

Open-source AI models in biology can build thriving communities that accelerate research while still supporting viable commercial products through infrastructure and user experience layers
Experimental validation across diverse targets and labs is crucial for establishing credibility in computational biology, requiring significant coordination and partnership efforts
The transition from structure prediction to molecular design represents a shift from problems with evolutionary hints to truly novel design challenges requiring different validation approaches
Inference-time scaling and ranking models are becoming critical for improving molecular design results, similar to trends in other AI domains
Specialized architectures still outperform general transformers in structural biology despite the 'bitter lesson', due to domain-specific physics and geometric constraints

Trends

Shift from regression to generative modeling in protein structure predictionInference-time scaling becoming critical for molecular design qualityOpen-source foundation models enabling commercial platform businessesIntegration of multiple AI models into agentic workflows for drug discoveryExperimental validation becoming a competitive differentiator in AI biologyDemocratization of advanced molecular design tools beyond large pharmaCommunity-driven development accelerating AI biology researchSpecialized GPU infrastructure becoming essential for molecular screeningCollaborative interfaces enabling medicinal chemist adoption of AI toolsExpansion from single-chain proteins to complex molecular interactions

Topics

AlphaFold evolution and impact Protein structure prediction Open-source AI model development Molecular design and drug discovery Experimental validation in computational biology Diffusion models for protein generation Inference-time scaling GPU infrastructure optimization Academic-industry collaboration Medicinal chemist workflow integration Nanobody and peptide design Binding affinity prediction Community-driven AI development Specialized AI architectures vs transformers Computational biology benchmarking

Companies

DeepMind

Created AlphaFold 2 and 3, the breakthrough protein folding models that revolutionized structural biology

Boltz

Public benefit company founded by the guests to democratize AI-powered molecular design tools

Isomorphic Labs

DeepMind spinoff that kept AlphaFold 3 proprietary for commercial drug discovery applications

MIT

Academic institution where both guests completed their PhDs and conducted foundational biology AI research

Genesis

Provided computational resources to help complete Boltz-1 model training when academic compute was insufficient

Adaptive Biotechnologies

CRO partner that conducted experimental validation testing for Boltz models across multiple targets

Harvard University

Collaborated through Nick Polizzi's group on developing better benchmarks for protein-small molecule interactions

People

Gabriela Corso

Co-founder of Boltz, MIT PhD graduate who transitioned from theoretical ML to structural biology after AlphaFold

Jeremy Volvind

Co-founder of Boltz, MIT PhD graduate focused on generative biology and molecular design validation

Hannes Stark

Boltz team member who developed the innovative atomic encoding approach for simultaneous structure and sequence predi...

Sergey Ovchinnikov

MIT researcher who provided insights into AlphaFold's pairwise architecture and contact prediction mechanisms

Nick Polizzi

Harvard researcher who collaborated on developing improved benchmarks for protein-small molecule binding prediction

Andrew White

Researcher mentioned for discussing the extensive computational efforts that preceded AlphaFold's breakthrough

Tim O'Donnell

Community member who proposed innovative inference-time search techniques for improving antibody-antigen predictions

Devon

CEO of Genesis who provided crucial computational resources to complete Boltz-1 model training

Quotes

"Actually we only trained the big model once. That's how much compute we had. We could only train it once."

Jeremy Volvind

"It's impossible to reproduce now. Yeah, yeah, no, that model is like, has gone to such a curriculum that, you know, learned some weird stuff. But yeah, somehow a miracle it worked out."

Jeremy Volvind

"When we say that we design new proteins or we say that we design new molecules, go and bind these particular targets, we should be very clear these are not drugs, these are not things that are ready to be put into a human."

Gabriela Corso

"I think at the end of the day, for people to be consistent, convinced, you have to show them something that they didn't think was possible."

Jeremy Volvind

"The great thing about kind of structure prediction is that a bit like CASP was doing, basically the way that you can evaluate them is that you train the model on a structure that was released across the field up until a certain time."

Gabriela Corso

Full Transcript

4 Speakers

Speaker A

Actually we only trained the big model once. That's how much compute we had. We could only train it once. And so like while the model was training, we were like finding bugs left and right, a lot of them that I wrote. And like I would. I remember like us like sort of like, you know, doing like surgery in the middle, like stopping the run, making the fix, like relaunching and yeah, we never actually went back to the start. We just like kept training it with like the bug fixes along the way, which was.