Towards an AI co-scientist

📅 2025-02-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Low efficiency in scientific hypothesis generation hampers biomedical discovery. Method: We propose an asynchronous multi-agent AI system powered by Gemini 2.0, introducing the novel “generate–debate–evolve” collaborative paradigm and a tournament-based hypothesis evolution mechanism, enabling test-time computation-driven hypothesis self-optimization. Our approach integrates multi-agent architecture, test-time scaling, and domain-specific knowledge constraints to support diverse tasks—including drug repurposing (e.g., AML in vitro tumor suppression), novel target identification (epigenetic targets for liver fibrosis, validated via organ-on-a-chip), and bacterial antimicrobial resistance mechanism analysis. Contribution/Results: This work presents the first systematic validation of AI-driven hypothesis discovery in biomedicine. Multiple original hypotheses were experimentally confirmed; several align closely with unpublished experimental data. The framework establishes a reproducible, empirically verifiable paradigm for AI-augmented scientific discovery.

Technology Category

Application Category

📝 Abstract
Scientific discovery relies on scientists generating novel hypotheses that undergo rigorous experimental validation. To augment this process, we introduce an AI co-scientist, a multi-agent system built on Gemini 2.0. The AI co-scientist is intended to help uncover new, original knowledge and to formulate demonstrably novel research hypotheses and proposals, building upon prior evidence and aligned to scientist-provided research objectives and guidance. The system's design incorporates a generate, debate, and evolve approach to hypothesis generation, inspired by the scientific method and accelerated by scaling test-time compute. Key contributions include: (1) a multi-agent architecture with an asynchronous task execution framework for flexible compute scaling; (2) a tournament evolution process for self-improving hypotheses generation. Automated evaluations show continued benefits of test-time compute, improving hypothesis quality. While general purpose, we focus development and validation in three biomedical areas: drug repurposing, novel target discovery, and explaining mechanisms of bacterial evolution and anti-microbial resistance. For drug repurposing, the system proposes candidates with promising validation findings, including candidates for acute myeloid leukemia that show tumor inhibition in vitro at clinically applicable concentrations. For novel target discovery, the AI co-scientist proposed new epigenetic targets for liver fibrosis, validated by anti-fibrotic activity and liver cell regeneration in human hepatic organoids. Finally, the AI co-scientist recapitulated unpublished experimental results via a parallel in silico discovery of a novel gene transfer mechanism in bacterial evolution. These results, detailed in separate, co-timed reports, demonstrate the potential to augment biomedical and scientific discovery and usher an era of AI empowered scientists.
Problem

Research questions and friction points this paper is trying to address.

AI co-scientist aids scientific hypothesis generation.
Multi-agent system enhances biomedical research discovery.
Automated process improves drug target identification.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-agent system for hypothesis generation
Asynchronous task execution framework
Tournament evolution process for self-improvement
J
Juraj Gottweis
Google Cloud AI Research
Wei-Hung Weng
Wei-Hung Weng
Google DeepMind
artificial intelligencemachine learningnatural language processingmedical imaginghealthcare
A
Alexander Daryin
Google Cloud AI Research
Tao Tu
Tao Tu
Columbia University, Google
multi-modal neuroimagingmachine learningneural information processing
Anil Palepu
Anil Palepu
PhD Student, Harvard-MIT Health Science & Technology
P
Petar Sirkovic
Google Cloud AI Research
A
Artiom Myaskovsky
Google Cloud AI Research
Felix Weissenberger
Felix Weissenberger
PhD in Computer Science, ETH
NeuroinformaticsComputational NeuroscienceDiscrete MathematicsMachine Learning
K
Keran Rong
Google DeepMind
Ryutaro Tanno
Ryutaro Tanno
Research Scientist, Google DeepMind
Machine LearningDeep LearningHealthcareComputer Vision
K
Khaled Saab
Google DeepMind
D
Dan Popovici
Google Research
J
Jacob Blum
Stanford University School of Medicine
F
Fan Zhang
Google Research
Katherine Chou
Katherine Chou
Google
MLHealthGraphics
A
Avinatan Hassidim
Google Research
Burak Gokturk
Burak Gokturk
Google
Computer VisionMachine Learning
Amin Vahdat
Amin Vahdat
Google
SystemsNetworkingComputer NetworksDistributed SystemsOperating Systems
Pushmeet Kohli
Pushmeet Kohli
DeepMind
AI for ScienceMachine LearningAI SafetyComputer VisionProgram Synthesis
Yossi Matias
Yossi Matias
Google
Andrew Carroll
Andrew Carroll
Google
K
Kavita Kulkarni
Google Research
Nenad Tomasev
Nenad Tomasev
Google DeepMind
artificial intelligencemachine learningstochastic optimizationartificial lifebioinformatics
Y
Yuan Guan
Stanford University School of Medicine
V
Vikram Dhillon
Houston Methodist
E
Eeshit Dhaval Vaishnav
Sequome
B
Byron Lee
Sequome
T
Tiago R D Costa
Fleming Initiative and Imperial College London
J
José R Penadés
Fleming Initiative and Imperial College London
G
Gary Peltz
Stanford University School of Medicine
Y
Yunhan Xu
Google DeepMind
A
Annalisa Pawlosky
Google Cloud AI Research
Alan Karthikesalingam
Alan Karthikesalingam
Google Health
Artificial Intelligence in Healthcare
V
Vivek Natarajan
Google Research