🤖 AI Summary
This study addresses the problem of hypothesis-free, continuous exploratory discovery—termed “hypothesis hunting”—in large-scale scientific data. To this end, we propose AScience, a novel framework centered on ASCollab: an LLM-driven heterogeneous scientific agent system. ASCollab formalizes scientific discovery as a dynamic, collaborative process among agents, networks, and evaluation norms—where agents self-organize into evolving networks under shared assessment criteria, autonomously generate hypotheses, and simulate peer review to sustainably produce high-quality, highly novel hypotheses. The method integrates autonomous agent technology, distributed collaboration mechanisms, and self-organizing network paradigms. Experiments on cancer cohort data successfully recapitulated known biomarkers, extended established signaling pathways, and proposed experimentally testable therapeutic targets. These results empirically validate the efficacy and feasibility of agent-based social architectures for large-scale hypothesis hunting.
📝 Abstract
Large-scale scientific datasets -- spanning health biobanks, cell atlases, Earth reanalyses, and more -- create opportunities for exploratory discovery unconstrained by specific research questions. We term this process hypothesis hunting: the cumulative search for insight through sustained exploration across vast and complex hypothesis spaces. To support it, we introduce AScience, a framework modeling discovery as the interaction of agents, networks, and evaluation norms, and implement it as ASCollab, a distributed system of LLM-based research agents with heterogeneous behaviors. These agents self-organize into evolving networks, continually producing and peer-reviewing findings under shared standards of evaluation. Experiments show that such social dynamics enable the accumulation of expert-rated results along the diversity-quality-novelty frontier, including rediscoveries of established biomarkers, extensions of known pathways, and proposals of new therapeutic targets. While wet-lab validation remains indispensable, our experiments on cancer cohorts demonstrate that socially structured, agentic networks can sustain exploratory hypothesis hunting at scale.