PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching

📅 2025-09-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Diffusion models for robotic trajectory planning suffer from reliance on expert demonstrations, low data efficiency, and theoretical suboptimality. This paper introduces PegasusFlow: a hierarchical, receding-horizon denoising framework that operates without expert demonstrations, enabling end-to-end trajectory optimization via parallel, environment-interaction-driven sampling of trajectory gradients. Its key contributions are: (1) the Weighted Basis Function Optimization (WBFO) algorithm, which integrates spline-based trajectory parameterization with asynchronous parallel simulation to significantly improve sampling efficiency and convergence speed; and (2) a novel diffusion strategy unifying flow matching and score-based sampling, incorporating an MPPI-inspired replacement mechanism and RL-based warm-start initialization. Evaluated on navigation and obstacle traversal tasks, PegasusFlow achieves 100% success rate—outperforming the best prior method by 18% in runtime—and supports large-scale parallel rollouts in complex terrains.

Technology Category

Application Category

📝 Abstract
Diffusion models offer powerful generative capabilities for robot trajectory planning, yet their practical deployment on robots is hindered by a critical bottleneck: a reliance on imitation learning from expert demonstrations. This paradigm is often impractical for specialized robots where data is scarce and creates an inefficient, theoretically suboptimal training pipeline. To overcome this, we introduce PegasusFlow, a hierarchical rolling-denoising framework that enables direct and parallel sampling of trajectory score gradients from environmental interaction, completely bypassing the need for expert data. Our core innovation is a novel sampling algorithm, Weighted Basis Function Optimization (WBFO), which leverages spline basis representations to achieve superior sample efficiency and faster convergence compared to traditional methods like MPPI. The framework is embedded within a scalable, asynchronous parallel simulation architecture that supports massively parallel rollouts for efficient data collection. Extensive experiments on trajectory optimization and robotic navigation tasks demonstrate that our approach, particularly Action-Value WBFO (AVWBFO) combined with a reinforcement learning warm-start, significantly outperforms baselines. In a challenging barrier-crossing task, our method achieved a 100% success rate and was 18% faster than the next-best method, validating its effectiveness for complex terrain locomotion planning. https://masteryip.github.io/pegasusflow.github.io/
Problem

Research questions and friction points this paper is trying to address.

Bypasses expert data need for robot trajectory planning
Enables parallel sampling from environmental interaction
Improves sample efficiency and convergence over traditional methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parallel rolling-denoising score sampling
Weighted Basis Function Optimization algorithm
Asynchronous parallel simulation architecture
🔎 Similar Papers
No similar papers found.
L
Lei Ye
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
H
Haibo Gao
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
P
Peng Xu
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
Z
Zhelin Zhang
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
J
Junqi Shan
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
Ao Zhang
Ao Zhang
Northwestern Polytechnical University
keyword spottingAutomatic Speech Recognition
W
Wei Zhang
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
Ruyi Zhou
Ruyi Zhou
State Key Laboratory of Robotics and System, Harbin Institute of Technology
RoboticsSpace roboticsWheeled mobile robotsScene physical understanding
Z
Zongquan Deng
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China
L
Liang Ding
State Key Laboratory of Robotics and Systems, Harbin Institute of Technology, Harbin 150001, China