Robotic Table Tennis: A Case Study into a High Speed Learning System

📅 2023-07-10
🏛️ Robotics: Science and Systems
📈 Citations: 11
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of real-time perception, precise landing-point control, and zero-shot simulation-to-reality transfer in high-speed human–robot table tennis rallies. We propose an end-to-end, low-latency closed-loop learning system. Methodologically, we integrate a lightweight deep vision model with hardware-in-the-loop latency modeling to build a ROS-based real-time motion control architecture; design a hybrid MuJoCo/PandaGym simulation environment featuring reinforcement learning policy optimization and fully automated physical environment reset; and introduce the first zero-shot sim-to-real transfer paradigm enabling direct cross-domain policy deployment. Experiments demonstrate that the physical robot sustains over 120 consecutive rallies, achieves landing-point errors <8 cm, incurs end-to-end latency per cycle <35 ms, and attains >92% zero-shot deployment success rate—marking the first demonstration of centurion-scale, high-precision, fully autonomous human–robot table tennis closed-loop learning.
📝 Abstract
We present a deep-dive into a real-world robotic learning system that, in previous work, was shown to be capable of hundreds of table tennis rallies with a human and has the ability to precisely return the ball to desired targets. This system puts together a highly optimized perception subsystem, a high-speed low-latency robot controller, a simulation paradigm that can prevent damage in the real world and also train policies for zero-shot transfer, and automated real world environment resets that enable autonomous training and evaluation on physical robots. We complement a complete system description, including numerous design decisions that are typically not widely disseminated, with a collection of studies that clarify the importance of mitigating various sources of latency, accounting for training and deployment distribution shifts, robustness of the perception system, sensitivity to policy hyper-parameters, and choice of action space. A video demonstrating the components of the system and details of experimental results can be found at https://youtu.be/uFcnWjB42I0.
Problem

Research questions and friction points this paper is trying to address.

high-speed robotic learning system
zero-shot policy transfer training
latency reduction in robot control
Innovation

Methods, ideas, or system contributions that make the work stand out.

Optimized perception subsystem
High-speed low-latency controller
Simulation for zero-shot transfer
🔎 Similar Papers
David B. D'Ambrosio
David B. D'Ambrosio
Lila Sciences
Artificial IntelligenceNeural NetworksNeuroevolutionRoboticsMultiagent Learning
J
Jonathan Abelian
Work done at Google DeepMind via FS Studio
Saminda Abeyruwan
Saminda Abeyruwan
University of Miami
Artificial IntelligenceAutonomous LearningMachine LearningReinforcement LearningSemantic Web
M
Michael Ahn
Google DeepMind
A
A. Bewley
Google DeepMind
J
Justin Boyd
Work done at Google DeepMind via FS Studio
K
K. Choromanski
Google DeepMind
O
O. Cortes
Work done at Google DeepMind via FS Studio
E
Erwin Coumans
Google DeepMind
T
Tianli Ding
Google DeepMind
W
Wenbo Gao
Google DeepMind
L
L. Graesser
Google DeepMind
Atil Iscen
Atil Iscen
Google
RoboticsReinforcement LearningEvolutionary AlgorithmsMulti-agent Learning
N
N. Jaitly
Google DeepMind
Deepali Jain
Deepali Jain
Google Deepmind
Artificial IntelligenceRoboticsReinforcement Learning
J
Juhana Kangaspunta
Google DeepMind
S
S. Kataoka
Google DeepMind
G
Gus Kouretas
Work done at Google DeepMind via Relentless Adrenaline
Y
Yuheng Kuang
Google DeepMind
N
N. Lazic
Google DeepMind
C
Corey Lynch
Google DeepMind
R
R. Mahjourian
Google DeepMind
S
Sherry Moore
Google DeepMind
T
T. Nguyen
Work done at Google DeepMind via FS Studio
K
Kenneth Oslund
Google DeepMind
B
Barney J Reed
Work done at Google DeepMind via Stickman Skills Center LLC
K
Krista Reymann
Google DeepMind
R
R. Sanketi
Google DeepMind
A
A. Shankar
Google DeepMind
P
P. Sermanet
Google DeepMind
Vikas Sindhwani
Vikas Sindhwani
Google DeepMind Robotics
AIRoboticsAI SafetyMachine LearningOptimization
Avi Singh
Avi Singh
Google Brain
Machine LearningRoboticsComputer Vision
Vincent Vanhoucke
Vincent Vanhoucke
Distinguished Engineer, Waymo
Robot LearningRoboticsMachine LearningComputer VisionArtificial Intelligence
Grace Vesom
Grace Vesom
Google DeepMind
Computer VisionMachine LearningShape Representation
P
Peng Xu
Google DeepMind