Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation

📅 2025-01-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the heavy reliance on manual annotations for fine-grained trait and part segmentation in biological specimen images, this paper proposes a novel “static segmentation as tracking” paradigm: given only one annotated image per species, static segmentation is reformulated as a cross-image mask propagation task via pseudo-video construction. Our method builds upon SAM2 and integrates three key components—pseudo-video generation, cross-image mask propagation, and cycle-consistent self-supervised fine-tuning—enabling efficient single-image adaptation. To our knowledge, this is the first approach achieving fine-grained segmentation under a one-shot (one annotated image per species) setting, attaining high accuracy on tasks such as butterfly wing pattern and beetle body segment segmentation. We further extend it to one-shot instance segmentation on field-captured images and trait-driven image retrieval. The framework significantly reduces annotation cost while enhancing the efficiency and scalability of biological trait analysis.

Technology Category

Application Category

📝 Abstract
We study image segmentation in the biological domain, particularly trait and part segmentation from specimen images (e.g., butterfly wing stripes or beetle body parts). This is a crucial, fine-grained task that aids in understanding the biology of organisms. The conventional approach involves hand-labeling masks, often for hundreds of images per species, and training a segmentation model to generalize these labels to other images, which can be exceedingly laborious. We present a label-efficient method named Static Segmentation by Tracking (SST). SST is built upon the insight: while specimens of the same species have inherent variations, the traits and parts we aim to segment show up consistently. This motivates us to concatenate specimen images into a ``pseudo-video'' and reframe trait and part segmentation as a tracking problem. Concretely, SST generates masks for unlabeled images by propagating annotated or predicted masks from the ``pseudo-preceding'' images. Powered by Segment Anything Model 2 (SAM~2) initially developed for video segmentation, we show that SST can achieve high-quality trait and part segmentation with merely one labeled image per species -- a breakthrough for analyzing specimen images. We further develop a cycle-consistent loss to fine-tune the model, again using one labeled image. Additionally, we highlight the broader potential of SST, including one-shot instance segmentation on images taken in the wild and trait-based image retrieval.
Problem

Research questions and friction points this paper is trying to address.

Biological Image Segmentation
Manual Annotation
Subtle Feature Differentiation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Static Segmentation Tracking (SST)
Segment Anything Model 2 (SAM~2)
Cyclic Consistency Adjustment
🔎 Similar Papers
No similar papers found.
Zhenyang Feng
Zhenyang Feng
University of California, Irvine
Computer Vision
Z
Zihe Wang
The Ohio State University
S
Saul Ibaven Bueno
The Ohio State University
T
Tomasz Frelek
The Ohio State University
Advikaa Ramesh
Advikaa Ramesh
Student, The Ohio State University
Computational SciencesComputer VisionMachine Learning
J
Jingyan Bai
The Ohio State University
Lemeng Wang
Lemeng Wang
Undergraduate Student, The Ohio State University
Computer VisionMachine Learning
Zanming Huang
Zanming Huang
The Ohio State University
Machine LearningComputer Vision
Jianyang Gu
Jianyang Gu
The Ohio State University
ImageomicsDataset DistillationData-centric AI
Jinsu Yoo
Jinsu Yoo
The Ohio State University
computer visionmachine learningautonomous driving
T
Tai-Yu Pan
The Ohio State University
A
A. Chowdhury
The Ohio State University
M
Michelle Ramirez
The Ohio State University
E
Elizabeth G. Campolongo
The Ohio State University
M
Matthew J. Thompson
The Ohio State University
C
Christopher G. Lawrence
Princeton University
Sydne Record
Sydne Record
Professor, University of Maine
BiogeographyCommunity Ecology
N
Neil Rosser
University of Miami
A
A. Karpatne
Virginia Tech
D
Daniel Rubenstein
Princeton University
H
H. Lapp
Duke University
Charles V. Stewart
Charles V. Stewart
Professor of Computer Science, Rensselaer Polytechnic Institute
Computer VisionMachine LearningApplications in Wildlife Ecology
T
Tanya Y. Berger-Wolf
The Ohio State University
Y
Yu Su
The Ohio State University
W
Wei-Lun Chao
The Ohio State University