Residual Control for Fast Recovery from Dynamics Shifts

📅 2026-03-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the significant performance degradation and slow recovery of robotic systems in real-world environments caused by unobserved dynamic shifts—such as actuator degradation or abrupt contact changes. To enable rapid adaptation during inference without retraining or prior knowledge of disturbances, the authors propose a stability-aligned residual control architecture that operates with a frozen policy and introduces a bounded additive residual channel. The core innovation is the Stability-Aligned Gating (SAG) mechanism, which enforces magnitude constraints, directional consistency, performance-conditioned activation, and adaptive gain modulation. This approach preserves the nominal controller’s structure while ensuring closed-loop stability and accelerating compensation. Evaluated on Go1, Cassie, H1, and Scout platforms, the method reduces average recovery time by 87%, 48%, 30%, and 20% respectively compared to a frozen SAC policy, achieving near-nominal steady-state performance.

Technology Category

Application Category

📝 Abstract
Robotic systems operating in real-world environments inevitably encounter unobserved dynamics shifts during continuous execution, including changes in actuation, mass distribution, or contact conditions. When such shifts occur mid-episode, even locally stabilizing learned policies can experience substantial transient performance degradation. While input-to-state stability guarantees bounded state deviation, it does not ensure rapid restoration of task-level performance. We address inference-time recovery under frozen policy parameters by casting adaptation as constrained disturbance shaping around a nominal stabilizing controller. We propose a stability-aligned residual control architecture in which a reinforcement learning policy trained under nominal dynamics remains fixed at deployment, and adaptation occurs exclusively through a bounded additive residual channel. A Stability Alignment Gate (SAG) regulates corrective authority through magnitude constraints, directional coherence with the nominal action, performance-conditioned activation, and adaptive gain modulation. These mechanisms preserve the nominal closed-loop structure while enabling rapid compensation for unobserved dynamics shifts without retraining or privileged disturbance information. Across mid-episode perturbations including actuator degradation, mass variation, and contact changes, the proposed method consistently reduces recovery time relative to frozen and online-adaptation baselines while maintaining near-nominal steady-state performance. Recovery time is reduced by \textbf{87\%} on the Go1 quadruped, \textbf{48\%} on the Cassie biped, \textbf{30\%} on the H1 humanoid, and \textbf{20\%} on the Scout wheeled platform on average across evaluated conditions relative to a frozen SAC policy.
Problem

Research questions and friction points this paper is trying to address.

dynamics shifts
fast recovery
robotic control
performance degradation
disturbance adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

residual control
dynamics shift adaptation
stability alignment
reinforcement learning
online adaptation
🔎 Similar Papers
No similar papers found.
N
Nethmi Jayasinghe
Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
D
Diana Gontero
Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
F
Francesco Migliarba
Department of Electrical and Computer Engineering, University of Illinois at Chicago, Chicago, IL, USA
S
Spencer T. Brown
Department of Neurobiology, Northwestern University, Evanston, IL, USA
Vinod K. Sangwan
Vinod K. Sangwan
Northwestern University; University of Maryland College Park; Indian Institute of Technology Bombay
Condensed matter physicsNeuromorphic computingQuantum materialsEnergy conversion and storage
M
Mark C. Hersam
Department of Materials Science and Engineering, Northwestern University, Evanston, IL, USA; Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, USA; Department of Chemistry, Northwestern University, Evanston, IL, USA
Amit Ranjan Trivedi
Amit Ranjan Trivedi
University of Illinois at Chicago
Machine learningedge roboticshardware security