OC-Distill: Ontology-aware Contrastive Learning with Cross-Modal Distillation for ICU Risk Prediction

📅 2026-04-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

180K/year
🤖 AI Summary
This work addresses limitations in existing ICU risk prediction methods, which often overlook the clinical semantic relationships among diagnoses during contrastive pretraining and struggle to effectively integrate multimodal information—such as clinical notes—during fine-tuning. To overcome these challenges, the authors propose OC-Distill, a two-stage framework that first leverages the hierarchical structure of ICD ontologies to construct a clinically aware contrastive learning objective, thereby modeling semantic similarity between patients. In the second stage, cross-modal knowledge distillation transfers supervisory signals from rich clinical notes into a lightweight model that relies solely on vital signs. Evaluated across multiple ICU prediction tasks on the MIMIC dataset, OC-Distill significantly outperforms state-of-the-art vital-signs-only baselines and achieves substantially improved label efficiency.

Technology Category

Application Category

📝 Abstract
Early prediction of severe clinical deterioration and remaining length of stay can enable timely intervention and better resource allocation in high-acuity settings such as the ICU. This has driven the development of machine learning models that leverage continuous streams of vital signs and other physiological signals for real-time risk prediction. Despite their promise, existing methods have important limitations. Contrastive pretraining treats all patients as equally strong negatives, failing to capture clinically meaningful similarity between patients with related diagnoses. Meanwhile, downstream fine-tuning typically ignores complementary modalities such as clinical notes, which provide rich contextual information unavailable in physiological signals alone. To address these challenges, we propose OC-Distill, a two-stage framework that leverages multimodal supervision during training while requiring only vital signs at inference. In the first stage, we introduce an ontology-aware contrastive objective that exploits the ICD hierarchy to quantify patient similarity and learn clinically grounded representations. In the second stage, we fine-tune the pretrained encoder via cross-modal knowledge distillation, transferring complementary information from clinical notes into the model. Across multiple ICU prediction tasks on MIMIC, OC-Distill demonstrates improved label efficiency and achieves state-of-the-art performance among methods that use only vital signs at inference.
Problem

Research questions and friction points this paper is trying to address.

ICU risk prediction
contrastive learning
cross-modal distillation
ontology-aware
clinical notes
Innovation

Methods, ideas, or system contributions that make the work stand out.

ontology-aware contrastive learning
cross-modal distillation
ICU risk prediction
knowledge distillation
multimodal supervision
🔎 Similar Papers
No similar papers found.
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
Zhongyuan Liang
Zhongyuan Liang
University of California, Berkeley
Machine LearningHealthcareCausal Inference
J
Junhyung Jo
Samsung Advanced Institute of Technology (SAIT)
H
Hyang-Jung Lee
Samsung Advanced Institute of Technology (SAIT)
S
Sang Kyu Kim
Samsung Advanced Institute of Technology (SAIT)
Irene Y. Chen
Irene Y. Chen
Assistant Professor, UC Berkeley and UC San Francisco
machine learninghealthcareequityprecision health