Scholar
Zuhao Yang
Google Scholar ID: TlBhP8EAAAAJ
Nanyang Technological University
video understanding
video generation
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
47
H-index
3
i10-index
2
Publications
9
Co-authors
6
list available
Contact
Email
yang0756@e.ntu.edu.sg
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
7 items
SVAgent: Storyline-Guided Long Video Understanding via Cross-Modal Multi-Agent Collaboration
2026
Cited
0
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
2025
Cited
0
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
2025
Cited
0
A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models
2025
Cited
0
TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding
2025
Cited
0
Versatile Transition Generation with Image-to-Video Diffusion
2025
Cited
0
ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models
2025
Cited
0
Resume (English only)
Academic Achievements
- Publications:
* SIGGRAPH Asia 2025
* EMNLP 2025
* ICCV 2025 (two papers)
* ACL 2025 (two papers)
* NeurIPS 2023
- Patents:
* Method, Device, and Medium for Video Temporal Grounding with Mixture-of-Experts, US Patent, 2025
* Method, Device, and Medium for Generating Transition Videos with Diffusion Model, SG Patent, 2024
* Method, Device, and Medium for Automatic Question-Answering, CN Patent, 2022
- Awards:
* Outstanding Graduate, University of Alberta, 2021
* Dean’s Honor Roll Award, University of Alberta, 2018-2020
* International Student Scholarship, University of Alberta, 2017-2019
Research Experience
- 2025.04 - Present: AI Scientist Intern, Shanda AI Research Institute & MiroMind.ai, Singapore
- 2023.11 - 2025.03: AI Research Intern, ByteDance Inc. & TikTok, Singapore
- 2021.05 - 2022.06: NLP Algorithm Engineer, TMI Robotics Technology, Shanghai
Education
- Doctor of Philosophy: 2024.01 - Present, Nanyang Technological University, College of Computing and Data Science, Supervisor: Prof. Shijian Lu
- Master in Artificial Intelligence: 2022.08 - 2024.01, Nanyang Technological University, College of Computing and Data Science
- Bachelor in Computing Science: 2017.09 - 2021.06, University of Alberta, Department of Computing Science
Background
- Research Interests: Video-centric multimodal intelligence, including controllable generation, temporal reasoning, agentic tool use, and long-term memory
- Professional Field: Computer Vision, Artificial Intelligence
- Brief Introduction: Ph.D. student at Nanyang Technological University, focusing on video-centric multimodal intelligence
Miscellany
- Personal Interests: Collaboration, communication
Co-authors
6 total
Shijian Lu
College of Computing and Data Science, NTU
Yang Xu
Southern University of Science and Technology
Wei Pang
Professor, Department of Computer Science, Heriot-Watt University
Dongyan Zhao
Peking University
Song Bai - 柏松
MiroMind, Singapore
Ling Shao, Fellow of IEEE/IAPR
Terminus; Founder of IIAI/MBZUAI
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up