Continual Multiple Instance Learning for Hematologic Disease Diagnosis

📅 2025-08-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current continual learning (CL) methods lack support for multi-instance learning (MIL), leading to catastrophic forgetting in single-cell blood disease diagnosis under streaming clinical data. Method: We propose the first MIL-specific CL framework, which jointly leverages instance-level attention and class-center distance metrics to dynamically select highly discriminative representative instances for exemplar sets. It further integrates bag-level mean representations with a replay mechanism to preserve both historical task diversity and knowledge stability. Results: Evaluated on real-world monthly single-cell leukemia data, our framework significantly outperforms existing CL baselines, effectively mitigating performance degradation and enabling robust adaptive updates to evolving clinical data distributions. Contribution: This work pioneers the integration of MIL with continual learning, establishing a scalable, low-forgetting incremental modeling paradigm tailored to streaming medical diagnostics.

Technology Category

Application Category

📝 Abstract
The dynamic environment of laboratories and clinics, with streams of data arriving on a daily basis, requires regular updates of trained machine learning models for consistent performance. Continual learning is supposed to help train models without catastrophic forgetting. However, state-of-the-art methods are ineffective for multiple instance learning (MIL), which is often used in single-cell-based hematologic disease diagnosis (e.g., leukemia detection). Here, we propose the first continual learning method tailored specifically to MIL. Our method is rehearsal-based over a selection of single instances from various bags. We use a combination of the instance attention score and distance from the bag mean and class mean vectors to carefully select which samples and instances to store in exemplary sets from previous tasks, preserving the diversity of the data. Using the real-world input of one month of data from a leukemia laboratory, we study the effectiveness of our approach in a class incremental scenario, comparing it to well-known continual learning methods. We show that our method considerably outperforms state-of-the-art methods, providing the first continual learning approach for MIL. This enables the adaptation of models to shifting data distributions over time, such as those caused by changes in disease occurrence or underlying genetic alterations.
Problem

Research questions and friction points this paper is trying to address.

Develop continual learning for multiple instance disease diagnosis
Prevent catastrophic forgetting in dynamic medical data environments
Adapt models to shifting data distributions over time
Innovation

Methods, ideas, or system contributions that make the work stand out.

First continual learning method for MIL
Rehearsal-based instance selection strategy
Combines attention score and distance metrics
🔎 Similar Papers
No similar papers found.
Z
Zahra Ebrahimi
Institute of AI for Health, Helmholtz Munich, Neuherberg, Germany; TUM School of Computation, Information and Technology, Technical University Munich, Munich, Germany; Faculty of Mathematics, Computer Science and Statistics, Ludwig-Maximilians-Universität München (LMU), Munich, Germany
R
Raheleh Salehi
Institute of AI for Health, Helmholtz Munich, Neuherberg, Germany; Institute of Chemical Epigenetics, Faculty of Chemistry and Pharmacy, Ludwig-Maximilians-Universität München (LMU), Munich, Germany
Nassir Navab
Nassir Navab
Professor of Computer Science, Technische Universität München
Carsten Marr
Carsten Marr
Institute of AI for Health @ Helmholtz Munich & Clinics @ LMU München
AI for Biomed & Health
A
Ario Sadafi
Institute of AI for Health, Helmholtz Munich, Neuherberg, Germany; Computer Aided Medical Procedures, Technical University of Munich, Munich, Germany