AffordanceSAM: Segment Anything Once More in Affordance Grounding

📅 2025-04-22

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

To address the limited generalization of affordance grounding models to unseen objects and novel affordances in embodied intelligence, this paper introduces the first SAM-based extension framework for functional region identification. Methodologically, it designs an affordance-adaptive module and establishes a coarse-to-fine multi-stage supervised training paradigm, enabling end-to-end generation of affordance heatmaps and facilitating the transfer of foundational segmentation models into functional semantic space. Crucially, it aligns vision foundation models with functional semantics, supporting zero-shot affordance localization. Evaluated on the AGD20K benchmark, the approach significantly outperforms state-of-the-art methods and demonstrates strong generalization to both unseen objects and novel affordances. This work provides a scalable, function-aware perception foundation for open-world interaction in embodied AI systems.

Technology Category

Application Category

📝 Abstract

Improving the generalization ability of an affordance grounding model to recognize regions for unseen objects and affordance functions is crucial for real-world application. However, current models are still far away from such standards. To address this problem, we introduce AffordanceSAM, an effective approach that extends SAM's generalization capacity to the domain of affordance grounding. For the purpose of thoroughly transferring SAM's robust performance in segmentation to affordance, we initially propose an affordance-adaption module in order to help modify SAM's segmentation output to be adapted to the specific functional regions required for affordance grounding. We concurrently make a coarse-to-fine training recipe to make SAM first be aware of affordance objects and actions coarsely, and then be able to generate affordance heatmaps finely. Both quantitative and qualitative experiments show the strong generalization capacity of our AffordanceSAM, which not only surpasses previous methods under AGD20K benchmark but also shows evidence to handle the task with novel objects and affordance functions.

Problem

Research questions and friction points this paper is trying to address.

Improving generalization for unseen object affordance recognition

Extending SAM's segmentation to affordance grounding tasks

Enhancing model performance with novel objects and functions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends SAM to affordance grounding

Uses affordance-adaption module

Employs coarse-to-fine training

🔎 Similar Papers

SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation