๐ค AI Summary
This study addresses a critical limitation in existing metrics for scientific novelty, which typically rely on aggregating pairwise distances between knowledge units and thus fail to capture holistic innovation arising from knowledge integration at the paper level. To overcome this, the authors propose the Cognitive Traversal Distance (CTD), a structure-aware measure grounded in weighted knowledge networks that quantifies novelty by computing the shortest path required to connect all knowledge units within a paper. Departing from conventional aggregation paradigms based on means or quantiles, CTD introducesโ for the first timeโa graph-theoretic shortest-path approach. Evaluated on a network of 27 million biomedical papers constructed from OpenAlex and MeSH, CTD significantly outperforms current metrics, demonstrates robustness to the introduction of entirely new conceptual labels, effectively identifies innovation driven by knowledge recombination, and exhibits strong discriminative power on both F1000Prime- and Nobel Prize-winning papers.
๐ Abstract
Scientific novelty is a critical construct in bibliometrics and is commonly measured by aggregating pairwise distances between the knowledge units underlying a paper. While prior work has refined how such distances are computed, less attention has been paid to how dyadic relations are aggregated to characterize novelty at the paper level. We address this limitation by introducing a network-based indicator, Cognitive Traversal Distance (CTD). Conceptualizing the historical literature as a weighted knowledge network, CTD is defined as the length of the shortest path required to connect all knowledge units associated with a paper. CTD provides a paper-level novelty measure that reflects the minimal structural distance needed to integrate multiple knowledge units, moving beyond mean- or quantile-based aggregation of pairwise distances. Using 27 million biomedical publications indexed by OpenAlex and Medical Subject Headings (MeSH) as standardized knowledge units, we evaluate CTD against expert-based novelty benchmarks from F1000Prime-recommended papers and Nobel Prize-winning publications. CTD consistently outperforms conventional aggregation-based indicators. We further show that MeSH-based CTD is less sensitive to novelty driven by the emergence of entirely new conceptual labels, clarifying its scope relative to recent text-based measures.