A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs

๐Ÿ“… 2026-04-06
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the multi-level incompleteness in multimodal electronic health records (EHRs)โ€”arising from irregular sampling, missing modalities, and sparse labelsโ€”by introducing a clinical point cloud paradigm. It unifies heterogeneous clinical events as points in a continuous 4D space defined by content, time, modality, and patient identity. A low-rank relational attention mechanism captures high-order dependencies among arbitrary events, enabling fine-grained self-supervised learning without rigid alignment through hierarchical interactions and an efficient sampling strategy. The proposed method substantially outperforms existing models on large-scale EHR-based risk prediction tasks, demonstrating exceptional performance and robustness, particularly under high rates of missing data.
๐Ÿ“ Abstract
Deep learning-based modeling of multimodal Electronic Health Records (EHRs) has become an important approach for clinical diagnosis and risk prediction. However, due to diverse clinical workflows and privacy constraints, raw EHRs are inherently multi-level incomplete, including irregular sampling, missing modalities, and sparse labels. These issues cause temporal misalignment, modality imbalance, and limited supervision. Most existing multimodal methods assume relatively complete data, and even methods designed for incompleteness usually address only one or two of these issues in isolation. As a result, they often rely on rigid temporal/modal alignment or discard incomplete data, which may distort raw clinical semantics. To address this problem, we propose HealthPoint (HP), a unified clinical point cloud paradigm for multi-level incomplete EHRs. HP represents heterogeneous clinical events as points in a continuous 4D space defined by content, time, modality, and case. To model interactions between arbitrary point pairs, we introduce a Low-Rank Relational Attention mechanism that efficiently captures high-order dependencies across these four dimensions. We further develop a hierarchical interaction and sampling strategy to balance fine-grained modeling and computational efficiency. Built on this framework, HP enables flexible event-level interaction and fine-grained self-supervision, supporting robust modality recovery and effective use of unlabeled data. Experiments on large-scale EHR datasets for risk prediction show that HP consistently achieves state-of-the-art performance and strong robustness under varying degrees of incompleteness.
Problem

Research questions and friction points this paper is trying to address.

incomplete multimodal EHRs
in-hospital mortality prediction
multi-level incompleteness
temporal misalignment
modality imbalance
Innovation

Methods, ideas, or system contributions that make the work stand out.

clinical point cloud
multi-level incomplete EHRs
low-rank relational attention
heterogeneous event modeling
self-supervised learning
๐Ÿ”Ž Similar Papers
No similar papers found.
B
Bohao Li
Beihang University
Tao Zou
Tao Zou
The Australian National University
Covariance regressionnetwork data modelingenvironmental statisticsfinancial statistics
J
Junchen Ye
Hong Kong Polytechnic University
Y
Yan Gong
Beihang University
Bowen Du
Bowen Du
Beihang University