đ¤ AI Summary
This paper investigates whether large language models (LLMs) can transcend information retrieval and instruction following to achieve genuine novel knowledge discovery. Method: Grounded in Peirceâs abductiveâdeductiveâinductive triadic logic, it establishes the first unified analytical framework for LLM-driven hypothesis generation toward AGI, systematically characterizing critical pathways and fundamental bottlenecks in generative knowledge discovery. It proposes a closed-loop âhypothesis generationâapplicationâvalidationâ technical architecture integrating prompt engineering, self-verifying reasoning, rule distillation, and empirical evaluation. Contribution/Results: Synthesizing over 100 state-of-the-art studies, the work identifies key advancesâincluding transferable hypothesis modeling, domain-adaptive validation, and enhanced causal interpretabilityâwhile revealing six persistent challenges: weak falsifiability, poor cross-domain generalization, among others. The framework provides both theoretical grounding and methodological foundations for evolving LLMs into scientific innovation engines.
đ Abstract
Since the advent of Large Language Models (LLMs), efforts have largely focused on improving their instruction-following and deductive reasoning abilities, leaving open the question of whether these models can truly discover new knowledge. In pursuit of artificial general intelligence (AGI), there is a growing need for models that not only execute commands or retrieve information but also learn, reason, and generate new knowledge by formulating novel hypotheses and theories that deepen our understanding of the world. Guided by Peirce's framework of abduction, deduction, and induction, this survey offers a structured lens to examine LLM-based hypothesis discovery. We synthesize existing work in hypothesis generation, application, and validation, identifying both key achievements and critical gaps. By unifying these threads, we illuminate how LLMs might evolve from mere ``information executors'' into engines of genuine innovation, potentially transforming research, science, and real-world problem solving.