🤖 AI Summary
Instance segmentation of fruits and obstacles in orchards faces challenges including severe occlusion, variable illumination, and high target density. Method: This study systematically benchmarks YOLOv8-seg against Mask R-CNN (ResNet-50-FPN) across accuracy, speed, and robustness. We propose the first empirical evaluation framework for YOLOv8-based instance segmentation tailored to dynamic agricultural environments; design a quantitative occlusion-robustness metric and a cross-model fair-comparison protocol; construct a multi-season RGB-D orchard dataset; and introduce occlusion-aware data augmentation and IoU-aware post-processing. Results: YOLOv8-seg achieves 42.3 FPS—substantially faster than Mask R-CNN’s 11.7 FPS—while incurring only a 2.1% mAP@0.5 drop overall. On heavily occluded subsets, however, Mask R-CNN retains a 3.8% mAP@0.5 advantage, confirming complementary applicability: YOLOv8-seg is optimal for real-time edge deployment, whereas Mask R-CNN better suits high-accuracy offline analysis.