UniManip: General-Purpose Zero-Shot Robotic Manipulation with Agentic Operational Graph

📅 2026-02-13
📈 Citations: 0
Influential: 0
📄 PDF

Technology Category

Application Category

📝 Abstract
Achieving general-purpose robotic manipulation requires robots to seamlessly bridge high-level semantic intent with low-level physical interaction in unstructured environments. However, existing approaches falter in zero-shot generalization: end-to-end Vision-Language-Action (VLA) models often lack the precision required for long-horizon tasks, while traditional hierarchical planners suffer from semantic rigidity when facing open-world variations. To address this, we present UniManip, a framework grounded in a Bi-level Agentic Operational Graph (AOG) that unifies semantic reasoning and physical grounding. By coupling a high-level Agentic Layer for task orchestration with a low-level Scene Layer for dynamic state representation, the system continuously aligns abstract planning with geometric constraints, enabling robust zero-shot execution. Unlike static pipelines, UniManip operates as a dynamic agentic loop: it actively instantiates object-centric scene graphs from unstructured perception, parameterizes these representations into collision-free trajectories via a safety-aware local planner, and exploits structured memory to autonomously diagnose and recover from execution failures. Extensive experiments validate the system's robust zero-shot capability on unseen objects and tasks, demonstrating a 22.5% and 25.0% higher success rate compared to state-of-the-art VLA and hierarchical baselines, respectively. Notably, the system enables direct zero-shot transfer from fixed-base setups to mobile manipulation without fine-tuning or reconfiguration. Our open-source project page can be found at https://henryhcliu.github.io/unimanip.
Problem

Research questions and friction points this paper is trying to address.

zero-shot manipulation
general-purpose robotics
semantic-to-physical grounding
unstructured environments
open-world generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Zero-Shot Manipulation
Agentic Operational Graph
Dynamic Scene Graph
Hierarchical Planning
Mobile Manipulation
H
Haichao Liu
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Y
Yuanjiang Xue
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Y
Yuheng Zhou
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Haoyuan Deng
Haoyuan Deng
Nanyang Technological University
RoboticsImitation LearningReinforcement Learning
Y
Yinan Liang
Department of Automation, Tsinghua University, China
Lihua Xie
Lihua Xie
Professor of Electrical Engineering, Nanyang Technological University
Robust controlNetworked ControlMult-agent Systems
Ziwei Wang
Ziwei Wang
School of Electrical and Electronic Engineering, Nanyang Technological University
embodied AIroboticscomputer vision