CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray

📅 2025-06-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses three critical clinical challenges in chest X-ray (CXR) analysis: long-tailed disease distribution, multi-label annotation noise, and zero-shot disease classification. To this end, we introduce the largest publicly available CXR benchmark to date—comprising 377,110 images and 45 disease classes, including 19 newly curated rare conditions—structured as a long-tailed, multi-label dataset. We propose the first unified framework integrating long-tailed learning, multi-label modeling, and vision-language zero-shot transfer. Key innovations include generative label denoising, uncertainty-aware inference, and a long-tail-robust loss function, evaluated under both noisy annotations and expert-verified ground truth. Experiments demonstrate a 12.3 percentage-point improvement in mean average precision (mAP) for long-tailed classification over strong baselines and an average top-1 accuracy of 41.7% on zero-shot tasks. This work significantly enhances coverage of rare diseases and clinical representativeness, establishing a foundational resource and a deployable paradigm for intelligent CXR diagnosis.

Technology Category

Application Category

📝 Abstract
The CXR-LT series is a community-driven initiative designed to enhance lung disease classification using chest X-rays (CXR). It tackles challenges in open long-tailed lung disease classification and enhances the measurability of state-of-the-art techniques. The first event, CXR-LT 2023, aimed to achieve these goals by providing high-quality benchmark CXR data for model development and conducting comprehensive evaluations to identify ongoing issues impacting lung disease classification performance. Building on the success of CXR-LT 2023, the CXR-LT 2024 expands the dataset to 377,110 chest X-rays (CXRs) and 45 disease labels, including 19 new rare disease findings. It also introduces a new focus on zero-shot learning to address limitations identified in the previous event. Specifically, CXR-LT 2024 features three tasks: (i) long-tailed classification on a large, noisy test set, (ii) long-tailed classification on a manually annotated"gold standard"subset, and (iii) zero-shot generalization to five previously unseen disease findings. This paper provides an overview of CXR-LT 2024, detailing the data curation process and consolidating state-of-the-art solutions, including the use of multimodal models for rare disease detection, advanced generative approaches to handle noisy labels, and zero-shot learning strategies for unseen diseases. Additionally, the expanded dataset enhances disease coverage to better represent real-world clinical settings, offering a valuable resource for future research. By synthesizing the insights and innovations of participating teams, we aim to advance the development of clinically realistic and generalizable diagnostic models for chest radiography.
Problem

Research questions and friction points this paper is trying to address.

Enhancing lung disease classification using chest X-rays
Addressing long-tailed and zero-shot learning challenges
Expanding dataset for rare diseases and noisy labels
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal models for rare disease detection
Generative approaches to handle noisy labels
Zero-shot learning for unseen diseases
🔎 Similar Papers
No similar papers found.
Mingquan Lin
Mingquan Lin
Assistant Professor at University of Minnesota
Medical image analysisDeep learning
G
G. Holste
Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, USA
S
Song Wang
Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, USA
Yiliang Zhou
Yiliang Zhou
University of California, Irvine
NLPAI in healthcareLLM
Y
Yishu Wei
Department of Population Health Sciences, Weill Cornell Medicine, New York, USA
Imon Banerjee
Imon Banerjee
Mayo Clinic, AZ
Deep learningNatural language processingPredictive modeling3D characterizationMedical image annotation
P
Pengyi Chen
Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, Shanghai, China
Tianjie Dai
Tianjie Dai
Shanghai Jiao Tong University
Multi-modal LearningRecommendation SystemEfficient Serving
Yuexi Du
Yuexi Du
PhD candidate @ Yale University
Computer VisionMedical Image AnalysisMulti-modal Learning
N
N. Dvornek
Department of Biomedical Engineering, Yale University, New Haven, USA, Department of Radiology & Biomedical Imaging, Yale University, New Haven, USA
Y
Yuyan Ge
Center for Innovation in Data Engineering and Science, Department of Computer and Information Science, University of Pennsylvania, Philadelphia, USA
Z
Zuowei Guo
School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, USA
S
Shouhei Hanaoka
Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
Dongkyun Kim
Dongkyun Kim
School of Computer Science, Carnegie Mellon University, Pittsburgh, USA
Pablo Messina
Pablo Messina
PhD Student, PUC Chile
computer visionnlpmedical imagingchest x-raysradiology reports
Y
Yang Lu
School of Informatics, Xiamen University, Xiamen, China
Denis Parra
Denis Parra
Pontificia Universidad Católica de Chile
Recommender SystemsIntelligent User InterfacesInformation VisualizationArtificial IntelligenceMedical AI
D
Donghyun Son
Seoul National University, Seoul, South Korea
'
'Alvaro Soto
Pontificia Universidad Católica de Chile, Santiago, Chile
A
A. Urooj
Department of Radiology, Mayo Clinic, Phoenix, USA
R
René Vidal
Center for Innovation in Data Engineering and Science, Departments of Electrical and Systems Engineering & Radiology, University of Pennsylvania, Philadelphia, USA
Y
Yosuke Yamagishi
Department of Radiology, The University of Tokyo Hospital, Tokyo, Japan
Zefan Yang
Zefan Yang
Rensselaer Polytechnic Institute
Medical Image AnalysisDeep LearningArtificial Intelligence
R
Ruichi Zhang
School of Informatics, Xiamen University, Xiamen, China
Y
Yang Zhou
Institute of High Performance Computing, A*STAR, Singapore
L
L. Celi
Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, USA, Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, USA, Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, USA
R
Ronald M. Summers
Clinical Center, National Institutes of Health, Bethesda, USA
Zhiyong Lu
Zhiyong Lu
Senior Investigator, NLM; Adjunct Professor of CS, UIUC
BioNLPBiomedical InformaticsMedical AIArtificial Intelligence
H
Hao Chen
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, China
A
Adam E. Flanders
Department of Radiology, Thomas Jefferson University Hospital, Philadelphia, USA
G
George Shih
Department of Population Health Sciences, Weill Cornell Medicine, New York, USA
Z
Zhangyang Wang
Department of Electrical and Computer Engineering, The University of Texas at Austin, Austin, USA
Y
Yifan Peng
Department of Population Health Sciences, Weill Cornell Medicine, New York, USA