🤖 AI Summary
In drug safety assessment, identifying rare nephrotoxic histopathological alterations from large-scale whole-slide images (WSIs) remains challenging due to the high cost and low throughput of manual review. To address this, we propose the first self-supervised learning framework for large-scale nephrotoxicity anomaly detection, leveraging the UNI foundation model to extract tissue representations and implementing a pixel-level annotation-free k-nearest neighbors (k-NN) anomaly detection scheme. Evaluated on a WSI dataset comprising kidney sections from 158 compounds, our method achieves an AUC of 0.62 and a negative predictive value of 89%, enabling efficient filtering of normal slides. Our key contribution lies in overcoming the performance bottleneck of direct foundation model classification by enabling weakly supervised, high-throughput anomaly localization tailored for preclinical toxicological screening—thereby significantly improving both efficiency and scalability of pathological evaluation.
📝 Abstract
Kidney abnormality detection is required for all preclinical drug development. It involves a time-consuming and costly examination of hundreds to thousands of whole-slide images per drug safety study, most of which are normal, to detect any subtle changes indicating toxic effects. In this study, we present the first large-scale self-supervised abnormality detection model for kidney toxicologic pathology, spanning drug safety assessment studies from 158 compounds. We explore the complexity of kidney abnormality detection on this scale using features extracted from the UNI foundation model (FM) and show that a simple k-nearest neighbor classifier on these features performs at chance, demonstrating that the FM-generated features alone are insufficient for detecting abnormalities. We then demonstrate that a self-supervised method applied to the same features can achieve better-than-chance performance, with an area under the receiver operating characteristic curve of 0.62 and a negative predictive value of 89%. With further development, such a model can be used to rule out normal slides in drug safety assessment studies, reducing the costs and time associated with drug development.