🤖 AI Summary
Invasive lung adenocarcinoma (ILA) exhibits a high 5-year postoperative recurrence rate of up to 70%, yet current clinical and computational tools lack sufficient accuracy for identifying high-risk patients. To address this, we propose CellEcoNet—a spatially aware deep learning framework that, for the first time, models individual cells in hematoxylin-and-eosin-stained whole-slide images (H&E WSIs) as “words.” Leveraging a natural language processing–inspired paradigm, CellEcoNet automatically learns spatial semantic features—including cell types, neighborhood relationships, and tissue architecture—to decode recurrence-associated biological signals from the tumor microenvironment. Evaluated on a multicenter cohort of 456 ILA cases, CellEcoNet achieves an AUC of 77.8% and a hazard ratio of 9.54, significantly outperforming established clinical benchmarks (IASLC grading, AJCC staging) and state-of-the-art computational models. Moreover, it demonstrates robustness and fairness across diverse demographic and clinical subgroups.
📝 Abstract
Despite surgical resection, ~70% of invasive lung adenocarcinoma (ILA) patients recur within five years, and current tools fail to identify those needing adjuvant therapy. To address this unmet clinical need, we introduce CellEcoNet, a novel spatially aware deep learning framework that models whole slide images (WSIs) through natural language analogy, defining a "language of pathology," where cells act as words, cellular neighborhoods become phrases, and tissue architecture forms sentences. CellEcoNet learns these context-dependent meanings automatically, capturing how subtle variations and spatial interactions derive recurrence risk. On a dataset of 456 H&E-stained WSIs, CellEcoNet achieved superior predictive performance (AUC:77.8% HR:9.54), outperforming IASLC grading system (AUC:71.4% HR:2.36), AJCC Stage (AUC:64.0% HR:1.17) and state-of-the-art computational methods (AUCs:62.2-67.4%). CellEcoNet demonstrated fairness and consistent performance across diverse demographic and clinical subgroups. Beyond prognosis, CellEcoNet marks a paradigm shift by decoding the tumor microenvironment's cellular "language" to reveal how subtle cell variations encode recurrence risk.