AffordanceGrasp-R1:Leveraging Reasoning-Based Affordance Segmentation with Reinforcement Learning for Robotic Grasping

📅 2026-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of insufficient understanding of object manipulable regions by robots under complex linguistic instructions. We propose a reasoning-driven manipulability segmentation framework that, for the first time, integrates a Chain-of-Thought cold-start strategy with reinforcement learning. The method generates grasp candidates from global point clouds and performs context-aware filtering using language-conditioned manipulability masks. By innovatively unifying spatial reasoning, semantic alignment, and grasp decision-making, our approach outperforms state-of-the-art methods across multiple benchmark datasets and demonstrates exceptional robustness and generalization in real-world robotic experiments.

Technology Category

Application Category

📝 Abstract
We introduce AffordanceGrasp-R1, a reasoning-driven affordance segmentation framework for robotic grasping that combines a chain-of-thought (CoT) cold-start strategy with reinforcement learning to enhance deduction and spatial grounding. In addition, we redesign the grasping pipeline to be more context-aware by generating grasp candidates from the global scene point cloud and subsequently filtering them using instruction-conditioned affordance masks. Extensive experiments demonstrate that AffordanceGrasp-R1 consistently outperforms state-of-the-art (SOTA) methods on benchmark datasets, and real-world robotic grasping evaluations further validate its robustness and generalization under complex language-conditioned manipulation scenarios.
Problem

Research questions and friction points this paper is trying to address.

affordance segmentation
robotic grasping
language-conditioned manipulation
spatial grounding
grasp generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

reasoning-based affordance segmentation
chain-of-thought cold-start
reinforcement learning
instruction-conditioned grasping
global scene point cloud
🔎 Similar Papers
No similar papers found.
D
Dingyi Zhou
Chair of Robotics, Artificial Intelligence and Real-time Systems, Technical University of Munich, Munich, Germany
Mu He
Mu He
Verizon
Programmable Data PlaneP4Software Defined NetworkingOptimization
Z
Zhuowei Fang
Chair of Robotics, Artificial Intelligence and Real-time Systems, Technical University of Munich, Munich, Germany
Xiangtong Yao
Xiangtong Yao
Ph.D. Student, Technische Universität München
Robot LearningRobotics
Y
Yinlong Liu
Faculty of Data Science, City University of Macau, Macau, China
A
Alois C. Knoll
Chair of Robotics, Artificial Intelligence and Real-time Systems, Technical University of Munich, Munich, Germany
H
Hu Cao
School of Automation, Southeast University, Nanjing, China