🤖 AI Summary
Existing citation dynamics models are domain-specific and mechanistically fragmented, failing to explain cross-domain phenomena such as “delayed recognition” and the ubiquity of “sleeping beauties.”
Method: We propose the first unified model integrating three fundamental mechanisms—cumulative advantage, temporal decay, and structural embedding—and validate it via multi-source citation network mining, temporal modeling, and cross-domain comparative analysis across scientific publications, legal cases, and patents.
Contribution/Results: We empirically establish the cross-domain universality of skewed citation distributions—including sleeping beauties—across all three knowledge systems. Our model significantly outperforms state-of-the-art baselines in reproducing empirical citation evolution and achieves 12–28% higher accuracy in predicting high-impact nodes. This work reveals universal principles governing citation dynamics beyond disciplinary boundaries and provides a generalizable framework for modeling knowledge diffusion and impact accumulation.
📝 Abstract
Many human knowledge systems, such as science, law, and invention, are built on documents and the citations that link them. Citations, while serving multiple purposes, primarily function as a way to explicitly document the use of prior work and thus have become central to the study of knowledge systems. Analyzing citation dynamics has revealed statistical patterns that shed light on knowledge production, recognition, and formalization, and has helped identify key mechanisms driving these patterns. However, most quantitative findings are confined to scientific citations, raising the question of universality of these findings. Moreover, existing models of individual citation trajectories fail to explain phenomena such as delayed recognition, calling for a unifying framework. Here, we analyze a newly available corpus of U.S. case law, in addition to scientific and patent citation networks, to show that they share remarkably similar citation patterns, including a heavy-tailed distribution of sleeping beauties. We propose a holistic model that captures the three core mechanisms driving collective dynamics and replicates the elusive phenomenon of delayed recognition. We demonstrate that the model not only replicates observed citation patterns, but also better predicts future successes by considering the whole system. Our work offers insights into key mechanisms that govern large-scale patterns of collective human knowledge systems and may provide generalizable perspectives on discovery and innovation across domains.