RADx Data Hub: A Cloud Repository for FAIR, Harmonized COVID-19 Data

📅 2025-02-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
The COVID-19 pandemic revealed critical challenges in public health, including delayed integration of multi-source data, lack of semantic interoperability, and sluggish emergency response. To address these, we developed the first FAIR-compliant, cloud-native data hub specifically designed for pandemic response. Our approach leverages ontology-driven data mapping—compatible with standards such as OMOP and LOINC—automated metadata governance, and a cross-system semantic interoperability architecture to unify, standardize, and de-identify heterogeneous data from clinical records, diagnostic testing, wearable sensors, symptom reporting, and social determinants of health. The platform has supported dozens of NIH RADx initiatives, substantially reducing data acquisition and analysis timelines. It has been formally designated by the NIH as a national core infrastructure for reusable COVID-19 data sharing and demonstrates extensibility to other disease domains.

Technology Category

Application Category

📝 Abstract
The COVID-19 pandemic highlighted the urgent need for robust systems to enable rapid data collection, integration, and analysis for public health responses. Existing approaches often relied on disparate, non-interoperable systems, creating bottlenecks in comprehensive analyses and timely decision-making. To address these challenges, the U.S. National Institutes of Health (NIH) launched the Rapid Acceleration of Diagnostics (RADx) initiative in 2020, with the RADx Data Hub, a centralized repository for de-identified and curated COVID-19 data, as its cornerstone. The RADx Data Hub hosts diverse study data, including clinical data, testing results, smart sensor outputs, self-reported symptoms, and information on social determinants of health. Built on cloud infrastructure, the RADx Data Hub integrates metadata standards, interoperable formats, and ontology-based tools to adhere to the FAIR (Findable, Accessible, Interoperable, Reusable) principles for data sharing. Initially developed for COVID-19 research, its architecture and processes are adaptable to other scientific disciplines. This paper provides an overview of the data hosted by the RADx Data Hub and describes the platform's capabilities and architecture.
Problem

Research questions and friction points this paper is trying to address.

COVID-19
Data Collection
Public Health Decision Making
Innovation

Methods, ideas, or system contributions that make the work stand out.

RADx Data Center
Cloud-based Interoperable Solutions
COVID-19 Data Standardization
🔎 Similar Papers
No similar papers found.
Marcos Martínez-Romero
Marcos Martínez-Romero
Technical Director, Stanford University
Semantic technologyKnowledge managementSemantic WebOntologiesMetadata
M
M. Horridge
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA
N
Nilesh Mistry
Booz Allen Hamilton Inc., McLean, VA 22102, USA
A
Aubrie Weyhmiller
Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, NC 27517, USA
J
Jimmy K. Yu
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA
A
Alissa Fujimoto
Booz Allen Hamilton Inc., McLean, VA 22102, USA
A
Aria Henry
Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, NC 27517, USA
M
Martin J. O'Connor
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA
S
Stephanie Suber
Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, NC 27517, USA
M
M. U. Akdogan
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA
Y
Yan Cao
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA
S
Somu Valliappan
Booz Allen Hamilton Inc., McLean, VA 22102, USA
J
Joanna O. Mieczkowska
Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, NC 27517, USA
Ashok Krishnamurthy
Ashok Krishnamurthy
Renaissance Computing Institute (RENCI), University of North Carolina at Chapel Hill, Chapel Hill, NC 27517, USA
M
Michael A. Keller
Booz Allen Hamilton Inc., McLean, VA 22102, USA
M
M. Musen
Stanford Center for Biomedical Informatics Research, Stanford University, Palo Alto, CA 94304, USA