AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

📅 2026-05-22

📈 Citations: 0

✨ Influential: 0

career value

172K/year

🤖 AI Summary

This work addresses the limitations of traditional geometric control in inverted flight of bi-directional thrust quadrotors, where actuator saturation and motor reversal delays necessitate heuristic tuning. To overcome these challenges, the authors propose a unified reinforcement learning–based control framework trained in JAX, integrating a bi-directional thrust dynamics model with Hopf fibration–based attitude representation. For the first time, separate policies are employed to explicitly learn transitions from upright-to-inverted and inverted-to-upright flight modes. The approach eliminates manual parameter tuning while remaining compatible with standard trajectory generation and tracking pipelines. In simulation, it achieves a 32% reduction in position RMSE and a 57% decrease in settling time. Real-world experiments demonstrate stable inverted flight across multiple yaw configurations (position RMSE < 0.35 m) and successful dual-mode circular trajectories, significantly enhancing dynamic performance and system compatibility.

📝 Abstract

Bidirectional thrust grants quadrotors a second equilibrium condition and increased control authority, expanding the envelope of possible aggressive maneuvers and enabling inverted flight, perching, and sensing. Prior geometric control approaches extend differential flatness through Hopf fibration-based attitude representations to support bidirectional thrust, but struggle with actuator saturation and motor reversal delay during inversions, requiring heuristic thrust posture scheduling and waypoint tuning. We propose a learning-based framework that modulates a constant reference trajectory to perform compact, position-constrained quadrotor inversions while remaining compatible with traditional trajectory generation and tracking across flight regimes. Separate policies are trained via reinforcement learning for nominal-to-inverted and inverted-to-nominal transitions. In JAX-based simulation, the proposed method achieves the lowest position deviation and settling time across all evaluated baselines, reducing position root mean square error (RMSE) by 32% and settling time by 57% relative to the strongest optimization-based baseline. Hardware experiments demonstrate successful inversion across multiple yaw configurations with position RMSE below 0.35m, and compatibility with downstream trajectory generation and control through circular flight in both regimes. Additionally, we provide an open-source implementation of the proposed framework.

Problem

Research questions and friction points this paper is trying to address.

quadrotor inversion

bidirectional thrust

actuator saturation

motor reversal delay

position-constrained maneuver

Innovation

Methods, ideas, or system contributions that make the work stand out.

bidirectional thrust

reinforcement learning

quadrotor inversion