🤖 AI Summary
Existing cell morphology quantification models are typically trained on single microscopy modalities, exhibiting poor generalizability across technical platforms (e.g., varying channel counts) and biological contexts. Method: We introduce CHAMMI-75—the first open, multi-channel pretraining dataset comprising 75 heterogeneous biological studies—enabling systematic integration of cross-platform, multi-channel, and multimodal microscopic images. We propose a channel-agnostic pretraining paradigm incorporating heterogeneous image registration, channel-wise normalization, and modality-aware multi-scale augmentation. Contribution/Results: Our approach significantly improves model robustness and generalization across diverse downstream tasks, empirically validating the critical role of modality diversity in cell morphology modeling. This work establishes a novel paradigm and provides essential resources for developing reusable, scalable foundation models in bioimage analysis.
📝 Abstract
Quantifying cell morphology using images and machine learning has proven to be a powerful tool to study the response of cells to treatments. However, models used to quantify cellular morphology are typically trained with a single microscopy imaging type. This results in specialized models that cannot be reused across biological studies because the technical specifications do not match (e.g., different number of channels), or because the target experimental conditions are out of distribution. Here, we present CHAMMI-75, an open access dataset of heterogeneous, multi-channel microscopy images from 75 diverse biological studies. We curated this resource from publicly available sources to investigate cellular morphology models that are channel-adaptive and can process any microscopy image type. Our experiments show that training with CHAMMI-75 can improve performance in multi-channel bioimaging tasks primarily because of its high diversity in microscopy modalities. This work paves the way to create the next generation of cellular morphology models for biological studies.