🤖 AI Summary
This work addresses the challenge that current large language models struggle to generate research directions that are both logically coherent and non-obvious. The authors propose a novel paradigm that decomposes academic papers into conceptual units, clusters them to construct a vocabulary of “idea atoms,” and separately models coherence and cognitive accessibility—formalizing the latter for the first time as a quantitative measure of novelty in research directions. By sampling combinations with high coherence but low cognitive accessibility, the method generates “heterogeneous” research ideas. Experiments on approximately 7,500 top-tier conference papers on large language models demonstrate that the proposed approach significantly outperforms baseline methods, yielding suggestions that maintain logical coherence while enhancing both diversity and innovativeness.
📝 Abstract
Large language models are adept at synthesizing and recombining familiar material, yet they often fail at a specific kind of creativity that matters most in research: producing ideas that are both coherent and non-obvious to the current community. We formalize this gap through cognitive availability, the likelihood that a research direction would be naturally proposed by a typical researcher given what they have worked on. We introduce a pipeline that (i) decomposes papers into granular conceptual units, (ii) clusters recurring units into a shared vocabulary of idea atoms, and (iii) learns two complementary models: a coherence model that scores whether a set of atoms constitutes a viable direction, and an availability model that scores how likely that direction is to be generated by researchers drawn from the community. We then sample "alien" directions that score high on coherence but low on availability. On a corpus of $\sim$7,500 recent LLM papers from NeurIPS, ICLR and ICML, we validate that (a) conceptual units preserve paper content under reconstruction, (b) idea atoms generalize across papers rather than memorizing paper-specific phrasing, and (c) the Alien sampler produces research directions that are more diverse than LLM baselines while maintaining coherence.