DiffAero: A GPU-Accelerated Differentiable Simulation Framework for Efficient Quadrotor Policy Learning

📅 2025-09-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Low learning efficiency and poor sim-to-real transfer hinder quadrotor control policy development. Method: This paper proposes a lightweight, GPU-accelerated differentiable simulation framework enabling environment-level and agent-level parallelism. It supports fully differentiable, GPU-native modeling of multiple dynamics models (e.g., rigid-body and aerodynamic coupling) and configurable sensor stacks (IMU, depth camera, LiDAR), unifying physics simulation with neural rendering. Contribution/Results: Its end-to-end differentiable architecture boosts simulation throughput by over two orders of magnitude versus conventional CPU-based simulators. Robust flight policies can be trained within hours on consumer-grade GPUs; deployed policies achieve zero-shot or lightweight fine-tuned transfer to real hardware, reducing trajectory tracking error by 42% in physical experiments. The framework demonstrates strong practicality for hybrid reinforcement learning and rapid policy deployment.

Technology Category

Application Category

📝 Abstract
This letter introduces DiffAero, a lightweight, GPU-accelerated, and fully differentiable simulation framework designed for efficient quadrotor control policy learning. DiffAero supports both environment-level and agent-level parallelism and integrates multiple dynamics models, customizable sensor stacks (IMU, depth camera, and LiDAR), and diverse flight tasks within a unified, GPU-native training interface. By fully parallelizing both physics and rendering on the GPU, DiffAero eliminates CPU-GPU data transfer bottlenecks and delivers orders-of-magnitude improvements in simulation throughput. In contrast to existing simulators, DiffAero not only provides high-performance simulation but also serves as a research platform for exploring differentiable and hybrid learning algorithms. Extensive benchmarks and real-world flight experiments demonstrate that DiffAero and hybrid learning algorithms combined can learn robust flight policies in hours on consumer-grade hardware. The code is available at https://github.com/flyingbitac/diffaero.
Problem

Research questions and friction points this paper is trying to address.

Efficient quadrotor control policy learning
GPU-accelerated differentiable simulation framework
Eliminating CPU-GPU data transfer bottlenecks
Innovation

Methods, ideas, or system contributions that make the work stand out.

GPU-accelerated differentiable simulation framework
Fully parallelized physics and GPU rendering
Integrated dynamics models and sensor stacks
🔎 Similar Papers
No similar papers found.
X
Xinhong Zhang
State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
R
Runqing Wang
State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
Yunfan Ren
Yunfan Ren
Postdoctoral Fellow, Robotics and Perception Group, UZH
Robotics
J
Jian Sun
State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
H
Hao Fang
State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
J
Jie Chen
Harbin Institute of Technology, and State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China
G
Gang Wang
State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China