AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

📅 2026-05-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of traditional geometric control in inverted flight of bi-directional thrust quadrotors, where actuator saturation and motor reversal delays necessitate heuristic tuning. To overcome these challenges, the authors propose a unified reinforcement learning–based control framework trained in JAX, integrating a bi-directional thrust dynamics model with Hopf fibration–based attitude representation. For the first time, separate policies are employed to explicitly learn transitions from upright-to-inverted and inverted-to-upright flight modes. The approach eliminates manual parameter tuning while remaining compatible with standard trajectory generation and tracking pipelines. In simulation, it achieves a 32% reduction in position RMSE and a 57% decrease in settling time. Real-world experiments demonstrate stable inverted flight across multiple yaw configurations (position RMSE < 0.35 m) and successful dual-mode circular trajectories, significantly enhancing dynamic performance and system compatibility.
📝 Abstract
Bidirectional thrust grants quadrotors a second equilibrium condition and increased control authority, expanding the envelope of possible aggressive maneuvers and enabling inverted flight, perching, and sensing. Prior geometric control approaches extend differential flatness through Hopf fibration-based attitude representations to support bidirectional thrust, but struggle with actuator saturation and motor reversal delay during inversions, requiring heuristic thrust posture scheduling and waypoint tuning. We propose a learning-based framework that modulates a constant reference trajectory to perform compact, position-constrained quadrotor inversions while remaining compatible with traditional trajectory generation and tracking across flight regimes. Separate policies are trained via reinforcement learning for nominal-to-inverted and inverted-to-nominal transitions. In JAX-based simulation, the proposed method achieves the lowest position deviation and settling time across all evaluated baselines, reducing position root mean square error (RMSE) by 32% and settling time by 57% relative to the strongest optimization-based baseline. Hardware experiments demonstrate successful inversion across multiple yaw configurations with position RMSE below 0.35m, and compatibility with downstream trajectory generation and control through circular flight in both regimes. Additionally, we provide an open-source implementation of the proposed framework.
Problem

Research questions and friction points this paper is trying to address.

quadrotor inversion
bidirectional thrust
actuator saturation
motor reversal delay
position-constrained maneuver
Innovation

Methods, ideas, or system contributions that make the work stand out.

bidirectional thrust
reinforcement learning
quadrotor inversion
trajectory modulation
geometric control
🔎 Similar Papers
2022-09-20IEEE Robotics and Automation LettersCitations: 0
G
Gabriel Rodriguez
Carnegie Mellon University
H
Henri Sayag
Carnegie Mellon University
Abhishek Rathod
Abhishek Rathod
National Robotics Engineering Center
Autonomous Aerial VehiclesModel Predictive ControlMotion Planning
J
John Stecklein
Carnegie Mellon University
S
Siddharth Saha
Shield AI
C
Christopher Barngrover
Shield AI
Wennie Tabib
Wennie Tabib
Carnegie Mellon University
RoboticsActive PerceptionSLAM