A Survey on Human Interaction Motion Generation

📅 2025-03-17

📈 Citations: 0

✨ Influential: 0

career value

219K/year

🤖 AI Summary

This paper addresses the core challenge of synthesizing human interactions—with other people, objects, and environments—in digital systems. It systematically surveys advances in generating human–human, human–object, and human–scene interactions. Methodologically, it introduces the first unified taxonomy integrating foundational concepts, modeling paradigms, multimodal datasets (e.g., AMASS, PROX), and evaluation metrics; conducts an in-depth analysis of key technical approaches—including deep generative models, physics-based simulation, cross-modal alignment, and motion-capture data augmentation; and constructs a structured knowledge graph to map technological evolution and critical bottlenecks. The contributions clarify the applicability boundaries and limitations of current methods, identify open challenges, and chart future research directions. The work provides theoretical foundations and practical guidance for embodied AI in robotics, natural VR interaction, and intelligent animation generation.

Technology Category

Application Category

📝 Abstract

Humans inhabit a world defined by interactions -- with other humans, objects, and environments. These interactive movements not only convey our relationships with our surroundings but also demonstrate how we perceive and communicate with the real world. Therefore, replicating these interaction behaviors in digital systems has emerged as an important topic for applications in robotics, virtual reality, and animation. While recent advances in deep generative models and new datasets have accelerated progress in this field, significant challenges remain in modeling the intricate human dynamics and their interactions with entities in the external world. In this survey, we present, for the first time, a comprehensive overview of the literature in human interaction motion generation. We begin by establishing foundational concepts essential for understanding the research background. We then systematically review existing solutions and datasets across three primary interaction tasks -- human-human, human-object, and human-scene interactions -- followed by evaluation metrics. Finally, we discuss open research directions and future opportunities.

Problem

Research questions and friction points this paper is trying to address.

Replicating human interaction behaviors in digital systems

Modeling intricate human dynamics and external interactions

Reviewing solutions for human-human, human-object, and human-scene interactions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep generative models for motion generation

Datasets for human interaction dynamics

Evaluation metrics for interaction tasks

🔎 Similar Papers

Aligning Human Motion Generation with Human Perceptions

2024-07-02arXiv.orgCitations: 0

TikTok

San Jose, California

Research Engineer/Scientist (all levels), World Models

TikTok

San Jose, California

Research Scientist Intern, Machine Perception for Input and Interaction (PhD)