Depth-Enhanced YOLO-SAM2 Detection for Reliable Ballast Insufficiency Identification

📅 2026-02-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

181K/year
🤖 AI Summary
This study addresses the low recall and insufficient reliability of existing RGB-based methods for detecting insufficient railway ballast. To overcome these limitations, we propose a highly robust detection framework that fuses RGB-D data. Our approach begins with a sleeper-aligned depth correction pipeline, which enhances depth accuracy through polynomial modeling, RANSAC, and temporal smoothing. Subsequently, we integrate YOLOv8 for object localization with SAM2 for instance segmentation, leveraging axis-aligned and rotated bounding box sampling along with geometric criteria to classify ballast conditions. Evaluated on real-world top-view RGB-D datasets, the proposed method significantly improves recall from 0.49 to 0.80 and achieves an F1 score exceeding 0.80, outperforming current state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
This paper presents a depth-enhanced YOLO-SAM2 framework for detecting ballast insufficiency in railway tracks using RGB-D data. Although YOLOv8 provides reliable localization, the RGB-only model shows limited safety performance, achieving high precision (0.99) but low recall (0.49) due to insufficient ballast, as it tends to over-predict the sufficient class. To improve reliability, we incorporate depth-based geometric analysis enabled by a sleeper-aligned depth-correction pipeline that compensates for RealSense spatial distortion using polynomial modeling, RANSAC, and temporal smoothing. SAM2 segmentation further refines region-of-interest masks, enabling accurate extraction of sleeper and ballast profiles for geometric classification. Experiments on field-collected top-down RGB-D data show that depth-enhanced configurations substantially improve the detection of insufficient ballast. Depending on bounding-box sampling (AABB or RBB) and geometric criteria, recall increases from 0.49 to as high as 0.80, and F1-score improves from 0.66 to over 0.80. These results demonstrate that integrating depth correction with YOLO-SAM2 yields a more robust and reliable approach for automated railway ballast inspection, particularly in visually ambiguous or safety-critical scenarios.
Problem

Research questions and friction points this paper is trying to address.

ballast insufficiency
railway inspection
RGB-D data
object detection
depth enhancement
Innovation

Methods, ideas, or system contributions that make the work stand out.

depth correction
YOLO-SAM2
ballast insufficiency detection
geometric analysis
RGB-D railway inspection
🔎 Similar Papers
No similar papers found.