Multitask Reinforcement Learning for Quadcopter Attitude Stabilization and Tracking using Graph Policy

📅 2025-03-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the coupled yet fundamentally distinct dual-control problem of quadrotor attitude tracking and strong robust stabilization under arbitrary initial conditions. We propose the first multi-task Soft Actor-Critic (SAC) framework leveraging Graph Convolutional Network (GCN)-based policies. Our key innovation lies in embedding GCNs into the shared multi-task policy network to enable unified modeling of heterogeneous control objectives. The resulting policy is highly compact—only 24 neurons per layer—enabling real-time onboard deployment at 400 Hz. Training is conducted in parallel within IsaacGym, and the learned policy is rigorously validated on a Pixhawk-based physical platform. Compared to single-task baselines, our approach achieves a 3.2× improvement in sample efficiency and a 2.8× acceleration in training convergence, while delivering millisecond-level real-time response with zero additional computational overhead.

Technology Category

Application Category

📝 Abstract
Quadcopter attitude control involves two tasks: smooth attitude tracking and aggressive stabilization from arbitrary states. Although both can be formulated as tracking problems, their distinct state spaces and control strategies complicate a unified reward function. We propose a multitask deep reinforcement learning framework that leverages parallel simulation with IsaacGym and a Graph Convolutional Network (GCN) policy to address both tasks effectively. Our multitask Soft Actor-Critic (SAC) approach achieves faster, more reliable learning and higher sample efficiency than single-task methods. We validate its real-world applicability by deploying the learned policy - a compact two-layer network with 24 neurons per layer - on a Pixhawk flight controller, achieving 400 Hz control without extra computational resources. We provide our code at https://github.com/robot-perception-group/GraphMTSAC_UAV/.
Problem

Research questions and friction points this paper is trying to address.

Quadcopter attitude stabilization and tracking challenges
Unified reward function for distinct control tasks
Real-world deployment on Pixhawk flight controller
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multitask deep reinforcement learning for quadcopter control
Graph Convolutional Network policy for unified task handling
Real-time deployment on Pixhawk at 400 Hz
🔎 Similar Papers
No similar papers found.