Scholar
Yuhang He
Google Scholar ID: H1p3ve8AAAAJ
Microsoft Research
Multimodal Learning
Machine Learning
World Model
Computer Vision
Spatial Audio
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
516
H-index
12
i10-index
15
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
27 items
Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models
2026
Cited
0
ReVision: Scaling Computer-Use Agents via Temporal Visual Redundancy Reduction
2026
Cited
0
Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices
2026
Cited
0
Rethinking Token-Level Credit Assignment in RLVR: A Polarity-Entropy Analysis
2026
Cited
0
Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)
2026
Cited
0
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
2026
Cited
0
GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task
2026
Cited
0
GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
2026
Cited
0
Load more
Resume (English only)
Co-authors
9 total
Long Chen
Waytous,Chinese Academy of Sciences
Andrew Markham
Department of Computer Science, University of Oxford
Niki Trigoni
University of Oxford
Anoop Cherian
Mitsubishi Electric Research Labs (MERL), Adj. Assoc. Prof. Australian National University
Sangyun Shin
University of Oxford
Jia-Xing Zhong
University of Oxford <- Peking University
Chen Feng (冯晨)
New York University
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up