Dense Policy: Bidirectional Autoregressive Learning of Actions

📅 2025-03-17

📈 Citations: 0

✨ Influential: 0

career value

186K/year

🤖 AI Summary

Autoregressive action prediction underperforms holistic generative methods in robotic manipulation due to limited contextual modeling and sequential bottlenecks. Method: This paper proposes a bidirectionally extended autoregressive learning paradigm, employing a lightweight encoder-only architecture that leverages bidirectional context modeling and coarse-to-fine iterative decoding to expand a single-frame initial state into a full action sequence in logarithmic time complexity. Contribution/Results: It establishes the first dense autoregressive policy framework enabling efficient inference, overcoming the inherent limitations of conventional unidirectional autoregression. Evaluated on multi-task robotic manipulation benchmarks, the method achieves state-of-the-art performance in sample efficiency, cross-task generalization, and inference speed—significantly outperforming mainstream generative policies.

Technology Category

Application Category

📝 Abstract

Mainstream visuomotor policies predominantly rely on generative models for holistic action prediction, while current autoregressive policies, predicting the next token or chunk, have shown suboptimal results. This motivates a search for more effective learning methods to unleash the potential of autoregressive policies for robotic manipulation. This paper introduces a bidirectionally expanded learning approach, termed Dense Policy, to establish a new paradigm for autoregressive policies in action prediction. It employs a lightweight encoder-only architecture to iteratively unfold the action sequence from an initial single frame into the target sequence in a coarse-to-fine manner with logarithmic-time inference. Extensive experiments validate that our dense policy has superior autoregressive learning capabilities and can surpass existing holistic generative policies. Our policy, example data, and training code will be publicly available upon publication. Project page: https: //selen-suyue.github.io/DspNet/.

Problem

Research questions and friction points this paper is trying to address.

Improve autoregressive policies for robotic manipulation tasks

Develop bidirectional learning for action sequence prediction

Enhance efficiency with lightweight encoder-only architecture

Innovation

Methods, ideas, or system contributions that make the work stand out.

Bidirectional autoregressive learning for action prediction

Lightweight encoder-only architecture for sequence unfolding

Coarse-to-fine action sequence generation with logarithmic-time inference

🔎 Similar Papers

No similar papers found.