On Efficient Variants of Segment Anything Model: A Survey

📅 2024-10-07
🏛️ arXiv.org
📈 Citations: 7
Influential: 0
📄 PDF
🤖 AI Summary
While the Segment Anything Model (SAM) exhibits strong generalization capability, its substantial computational overhead hinders deployment on resource-constrained edge devices. This work presents a systematic survey of efficient SAM variants tailored for edge deployment. We introduce the first unified evaluation framework spanning diverse hardware platforms—including CPU, GPU, and Edge TPU—and conduct joint accuracy–latency–memory benchmarking on COCO and SA-1B. Our analysis categorizes acceleration techniques along six technical axes: model pruning, knowledge distillation, lightweight attention mechanisms, quantization, module substitution, and hardware-aware compilation—characterizing their Pareto-optimal trade-offs. The core contributions are: (1) an open-source, fully reproducible edge-SAM benchmark; and (2) empirical insights into the applicability domains and fundamental accuracy-efficiency trade-offs of each acceleration strategy—providing both theoretical foundations and practical guidelines for designing lightweight vision foundation models.

Technology Category

Application Category

📝 Abstract
The Segment Anything Model (SAM) is a foundational model for image segmentation tasks, known for its strong generalization across diverse applications. However, its impressive performance comes with significant computational and resource demands, making it challenging to deploy in resource-limited environments such as edge devices. To address this, a variety of SAM variants have been proposed to enhance efficiency while keeping accuracy. This survey provides the first comprehensive review of these efficient SAM variants. We begin by exploring the motivations driving this research. We then present core techniques used in SAM and model acceleration. This is followed by a detailed exploration of SAM acceleration strategies, categorized by approach, and a discussion of several future research directions. Finally, we offer a unified and extensive evaluation of these methods across various hardware, assessing their efficiency and accuracy on representative benchmarks, and providing a clear comparison of their overall performance.
Problem

Research questions and friction points this paper is trying to address.

Addressing high computational demands of Segment Anything Model
Enhancing SAM efficiency for resource-limited environments
Surveying acceleration techniques for SAM variants
Innovation

Methods, ideas, or system contributions that make the work stand out.

Efficient SAM variants for resource-limited deployment
Comprehensive review of model acceleration techniques
Unified evaluation across hardware for performance comparison
🔎 Similar Papers
No similar papers found.
Xiaorui Sun
Xiaorui Sun
University of Illinois at Chicago
Theoretical Computer Science
J
Jun Liu
School of Computing and Communications, Lancaster University
H
Hengtao Shen
School of Computer Science and Engineering, UESTC; School of Computer Science and Technology, Tongji University
X
Xiaofeng Zhu
School of Computer Science and Engineering, UESTC
Ping Hu
Ping Hu
UESTC
Computer VisionDeep LearningImage/Video Processing