From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans

📅 2025-03-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing challenges in CT imaging—including scattered and low-contrast 3D lymph node (LN) detection, weak inter-slice consistency modeling, and heavy reliance on post-processing—this paper proposes LN-Tracker, the first end-to-end framework that formulates 3D LN detection as an autoregressive tracking task along the z-axis. Its core innovations include: (1) decoupling detection and tracking queries; (2) introducing mask-guided cross-slice attention; (3) designing an inter-slice similarity contrastive loss; and (4) integrating multi-scale features. Built upon an enhanced DETR architecture, LN-Tracker jointly optimizes detection and instance association. Evaluated on four major LN datasets, it achieves ≥2.7% average sensitivity gain. Moreover, it generalizes effectively to pulmonary nodule and prostate tumor detection, attaining state-of-the-art performance in both tasks. The code and a newly annotated, publicly available dataset are released.

Technology Category

Application Category

📝 Abstract
Lymph node (LN) assessment is an essential task in the routine radiology workflow, providing valuable insights for cancer staging, treatment planning and beyond. Identifying scatteredly-distributed and low-contrast LNs in 3D CT scans is highly challenging, even for experienced clinicians. Previous lesion and LN detection methods demonstrate effectiveness of 2.5D approaches (i.e, using 2D network with multi-slice inputs), leveraging pretrained 2D model weights and showing improved accuracy as compared to separate 2D or 3D detectors. However, slice-based 2.5D detectors do not explicitly model inter-slice consistency for LN as a 3D object, requiring heuristic post-merging steps to generate final 3D LN instances, which can involve tuning a set of parameters for each dataset. In this work, we formulate 3D LN detection as a tracking task and propose LN-Tracker, a novel LN tracking transformer, for joint end-to-end detection and 3D instance association. Built upon DETR-based detector, LN-Tracker decouples transformer decoder's query into the track and detection groups, where the track query autoregressively follows previously tracked LN instances along the z-axis of a CT scan. We design a new transformer decoder with masked attention module to align track query's content to the context of current slice, meanwhile preserving detection query's high accuracy in current slice. An inter-slice similarity loss is introduced to encourage cohesive LN association between slices. Extensive evaluation on four lymph node datasets shows LN-Tracker's superior performance, with at least 2.7% gain in average sensitivity when compared to other top 3D/2.5D detectors. Further validation on public lung nodule and prostate tumor detection tasks confirms the generalizability of LN-Tracker as it achieves top performance on both tasks. Datasets will be released upon acceptance.
Problem

Research questions and friction points this paper is trying to address.

Challenges in detecting scattered, low-contrast lymph nodes in 3D CT scans.
Limitations of 2.5D detectors in modeling inter-slice consistency for 3D lymph nodes.
Need for heuristic post-merging steps and dataset-specific parameter tuning in existing methods.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Autoregressive transformer for 3D lymph node tracking
Masked attention module for inter-slice consistency
End-to-end detection with inter-slice similarity loss
Qinji Yu
Qinji Yu
PhD student, Shanghai Jiao Tong University
Computer VisionMedical Image Analysis
Yirui Wang
Yirui Wang
Amazon
Object DetectionTrackingMedical Image AnalysisComputer-aided Diagnosis
K
K. Yan
DAMO Academy, Alibaba Group; Hupan Lab, 310023, Hangzhou, China
D
Dandan Zheng
The First Affiliated Hospital of Zhejiang University
D
Dashan Ai
Fudan University Shanghai Cancer Center
D
Dazhou Guo
DAMO Academy, Alibaba Group
Zhanghexuan Ji
Zhanghexuan Ji
Senior Algorithm Engineer, Alibaba Group (U.S.) - DAMO Academy
Multimodel LearningLabel Efficient LearningContinual LearningComputer VisionMedical Imaging
Yanzhou Su
Yanzhou Su
FZU, UESTC
medical image analysis
Y
Yun Bian
Changhai Hospital
N
Na Shen
Zhongshan Hospital, Fudan University
X
Xiaowei Ding
Shanghai Jiao Tong University
Le Lu
Le Lu
Ant Group, IEEE Fellow, MICCAI Board Member (2021-2025)
Computer VisionMedical Image AnalysisMedical Image ComputingBiomedical Image Analysis
X
Xianghua Ye
The First Affiliated Hospital of Zhejiang University
Dakai Jin
Dakai Jin
Alibaba DAMO Academy USA
Deep LearningMedical Image AnalysisAI for Healthcare