Scholar

Haiyang Liu

Google Scholar ID: U_9vNgsAAAAJ

The University of Tokyo

Human Video GenerationMotion GenerationMulti-Modal Understanding and Generation

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

413

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

17 items

SplitAvatar: One-shot Head Avatar with Autoregressive Gaussian Splitting

2026

Cited

PersonaGesture: Single-Reference Co-Speech Gesture Personalization for Unseen Speakers

2026

Cited

FastPFRec: A Fast Personalized Federated Recommendation with Secure Sharing

2026

Cited

FSMC-Pose: Frequency and Spatial Fusion with Multiscale Self-calibration for Cattle Mounting Pose Estimation

2026

Cited

TDMM-LM: Bridging Facial Understanding and Animation via Language Models

2026

Cited

DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation

2026

Cited

Towards Interactive Intelligence for Digital Humans

2025

Cited

FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation

2025

Cited

Resume (English only)

Academic Achievements

Selected publications include 'Livatar-1: Real-Time Talking Heads Generation with Tailored Flow Matching', 'Video Motion Graphs', 'TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation', 'EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Mask Audio Gesture Modeling', 'BEAT: A Large Scale Semantic and Emotional Multi Modal Dataset for Conversational Gesture Synthesis', and 'DisCo: Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gesture Synthesis'.

Research Experience

During the PhD, interned at Hedra Research (real-time human video generation), Adobe Research - Video AI Lab (multi-modal human video generation, mentor: Yang Zhou), CyberAgent AI Lab - Computer Graphic Group (co-speech video generation, mentor: Takafumi Taketomi), and Huawei Research Tokyo - Digital Human Lab (co-speech gesture generation, mentor: Naoya Iwamoto).

Education

Received M.E. from Waseda University in 2020.9; B.E. from Southeast University in 2019.9.

Background

Currently a final-year PhD student in Information Science and Technology at The University of Tokyo, focusing on human video generation and motion generation using multi-modal conditions such as speech, text scripts, keypoints, and images. Interested in impact-driven research problems and simple yet effective ideas.

Miscellany