SkillMimic: Learning Basketball Interaction Skills from Demonstrations

📅 2024-08-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional HOI reinforcement learning relies on hand-crafted skill rewards, suffering from poor generalization and limited scalability. This paper proposes the first unified data-driven HOI imitation reward framework, eliminating explicit reward engineering and enabling end-to-end learning of diverse basketball interaction skills directly from demonstrations. Methodologically, we integrate behavioral cloning with hierarchical imitation learning, curating a 35-minute high-fidelity basketball motion dataset and employing a hierarchical control architecture to learn multiple skills within a single policy with seamless switching. Key contributions include: (1) the first HOI-aware imitation reward mechanism; (2) automatic improvement in skill diversity and cross-task generalization as dataset scale increases; and (3) successful acquisition of multi-style skills—including dribbling, layups, and shooting—and their composition into long-horizon complex tasks such as continuous scoring sequences.

Technology Category

Application Category

📝 Abstract
Traditional reinforcement learning methods for human-object interaction (HOI) rely on labor-intensive, manually designed skill rewards that do not generalize well across different interactions. We introduce SkillMimic, a unified data-driven framework that fundamentally changes how agents learn interaction skills by eliminating the need for skill-specific rewards. Our key insight is that a unified HOI imitation reward can effectively capture the essence of diverse interaction patterns from HOI datasets. This enables SkillMimic to learn a single policy that not only masters multiple interaction skills but also facilitates skill transitions, with both diversity and generalization improving as the HOI dataset grows. For evaluation, we collect and introduce two basketball datasets containing approximately 35 minutes of diverse basketball skills. Extensive experiments show that SkillMimic successfully masters a wide range of basketball skills including stylistic variations in dribbling, layup, and shooting. Moreover, these learned skills can be effectively composed by a high-level controller to accomplish complex and long-horizon tasks such as consecutive scoring, opening new possibilities for scalable and generalizable interaction skill learning. Project page: https://ingrid789.github.io/SkillMimic/
Problem

Research questions and friction points this paper is trying to address.

Eliminates need for manual skill-specific rewards in HOI learning
Learns diverse basketball skills from unified imitation rewards
Enables complex task composition via high-level controller
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified data-driven framework eliminates skill-specific rewards
Single policy learns diverse basketball interaction skills
High-level controller composes skills for complex tasks
🔎 Similar Papers
No similar papers found.
Yinhuai Wang
Yinhuai Wang
Hong Kong University of Science and Technology
Computer VisionRoboticsEmbodied AI
Q
Qihan Zhao
The Hong Kong University of Science and Technology, China
R
Runyi Yu
The Hong Kong University of Science and Technology, China and Unitree Robotics, China
H
Hok Wai Tsui
The Hong Kong University of Science and Technology, China
Ailing Zeng
Ailing Zeng
Anuttacon
Deep LearningComputer VisionVirtual HumansVideo Generation
J
Jing Lin
Tsinghua Shenzhen International Graduate School, China
Z
Zhengyi Luo
Carnegie Mellon University, USA
Jiwen Yu
Jiwen Yu
University of Hong Kong
Video Generation ModelEmbodied AI
Xiu Li
Xiu Li
Bytedance Seed
Computer VisionComputer Graphics3D Vision
Qifeng Chen
Qifeng Chen
HKUST
Computational PhotographyImage SynthesisGenerative AIAutonomous DrivingEmbodied AI
J
Jian Zhang
Peking University Shenzhen Graduate School, China
L
Lei Zhang
International Digital Economy Academy, China
Ping Tan
Ping Tan
Hong Kong University of Science and Technology (HKUST)
Computer VisionComputer Graphics