Linking Global Science Funding to Research Publications

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of accurately linking global research funding agencies to their supported scholarly publications to enable systematic analysis of research funding flows. To this end, we propose a multi-stage hybrid disambiguation framework that integrates lexical normalization, similarity-based clustering, rule-based matching, named entity recognition, and manual validation to map ambiguous funding statements to unique organizational identifiers. Our approach successfully standardizes 1.9 million distinct funding strings, achieving high recall and precision as validated through document-level comparison and expert review. By publicly releasing annotations of match types alongside unresolved cases, our method significantly enhances transparency, reproducibility, and the capacity for large-scale analysis of the global research funding landscape.

Technology Category

Application Category

📝 Abstract
Funding acknowledgments in scholarly publications provide large-scale trace data on organizations that support scientific research. We present a dataset for linking global science funding organizations to research publications by systematically disambiguating unique funding acknowledgment strings extracted from publication metadata. Funder names are matched to standardized organizational identifiers using a multi-stage pipeline that combines lexical normalization, similarity-based clustering, rule-based matching, named entity recognition assistance, and manual validation. The resulting dataset links 1.9 million unique funder strings to canonical organization identifiers and records match types and unresolved cases to support transparency. Technical validation includes paper-level comparisons across bibliometric sources and manual verification against full-text acknowledgment sections, with reported recall and precision metrics. This dataset supports analyses of funding flows, institutional funding portfolios, regional representation, and concentration patterns in the global research system.
Problem

Research questions and friction points this paper is trying to address.

funding acknowledgment
research funding
organization disambiguation
bibliometric linkage
scientific publication
Innovation

Methods, ideas, or system contributions that make the work stand out.

funding acknowledgment disambiguation
organization identifier mapping
multi-stage matching pipeline
bibliometric validation
research funding dataset
🔎 Similar Papers
No similar papers found.
J
Jacob A. Dalsgaard
Center for Social Data Science, University of Copenhagen, Denmark
F
Filipi N Silva
Indiana University, USA
Jin Ai
Jin Ai
Rutgers University - Newark
Computational Social ScienceScience and Technology PolicyNonprofit and Philanthropy