Residual Reinforcement Learning for Robot Teleoperation under Stochastic Delays

πŸ“… 2026-05-14
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

220K/year
πŸ€– AI Summary
This work addresses the challenge of discontinuous teleoperation signals caused by stochastic communication delays, which induce control instability and high-frequency jitterβ€”issues poorly handled by conventional reinforcement learning approaches. To overcome this, the authors propose a delay-robust hybrid control framework that uniquely integrates an LSTM-based state estimator with a residual reinforcement learning policy. The LSTM module reconstructs continuous system states from delayed observations, while the residual policy learns compensatory torques that jointly optimize trajectory tracking accuracy and joint velocity smoothness. Evaluated on a Franka Panda robotic platform, the proposed method demonstrates superior performance over existing techniques, maintaining stable and smooth teleoperation even under high-variance random delays.
πŸ“ Abstract
Stochastic communication delays in teleoperation introduce signal discontinuities that undermine control stability and degrade control performance. Consequently, the conventional reinforcement learning (RL) methods struggle with the delayed observations due to the delay-induced observations, leading to high-frequency chattering. To address this, we propose a hybrid control framework, delay-resilient RL, integrating a state estimator utilizing Long Short-Term Memory (LSTM) with a residual RL policy, which is resilient to stochastic delays. The LSTM reconstructs smooth, continuous state estimates from delayed observations, enabling the RL agent to learn a residual torque compensation policy that balances tracking accuracy with velocity smoothness. Experimental validation on Franka Panda robots demonstrates that our approach significantly outperforms the state-of-the-art baselines, ensuring robust and stable teleoperation even under high-variance stochastic delays.
Problem

Research questions and friction points this paper is trying to address.

teleoperation
stochastic delays
reinforcement learning
control stability
signal discontinuities
Innovation

Methods, ideas, or system contributions that make the work stand out.

residual reinforcement learning
stochastic delays
LSTM-based state estimation
teleoperation
delay-resilient control
πŸ”Ž Similar Papers
No similar papers found.