Scholar
Cong Wei
Google Scholar ID: y1d5C5YAAAAJ
University of Waterloo
Reasoning
Diffusion
Efficiency
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,044
H-index
10
i10-index
11
Publications
14
Co-authors
10
list available
Contact
Email
congwei1230@gmail.com
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
9 items
Context Forcing: Consistent Autoregressive Video Generation with Long Context
2026
Cited
0
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models
2025
Cited
0
UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
2025
Cited
0
UniVideo: Unified Understanding, Generation, and Editing for Videos
2025
Cited
0
Advancing Visual Large Language Model for Multi-granular Versatile Perception
2025
Cited
0
MoCha: Towards Movie-Grade Talking Character Synthesis
2025
Cited
0
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
2025
Cited
0
A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
- MoCha: Towards Movie-Grade Talking Character Synthesis, NeurIPS 2025 (Spotlight Presentation)
- UniVideo: Unified Understanding, Generation, and Editing for Videos, Arxiv 2025
- OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision, ICLR 2025
- Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers, CVPR 2023
- UniIR: Training and Benchmarking Universal Multimodal Information Retrievers, ECCV 2024 (Oral Presentation)
- AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks, TMLR 2024 (TMLR Reproducibility Certification)
- MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI, CVPR 2024 (Oral Presentation, Best Paper Finalist)
Research Experience
- Kuaishou Technology KlingAI, May 2025 - Present, Research Scientist Intern
- Meta GenAI, US, Oct 2024 - Apr 2025, Research Scientist Intern
- ModiFace, Canada, May 2022 - Nov 2022, Machine Learning Researcher Intern
- Vector Institute, Canada, Sep 2020 - Sep 2021, Undergraduate Researcher
Education
- University of Waterloo, Canada
- PhD in Computer Science, May 2023 - Present, Advisor: Wenhu Chen
- University of Toronto, Canada
- Master of Science in Applied Computing, Sep 2021 - Jun 2023, Advisor: Florian Shkurti
- Honours Bachelor of Science, Sep 2017 - May 2021, Majors: Computer Science, Statistics, Minor: Mathematics, Advisors: David Duvenaud
- Vector Institute, Undergraduate Researcher, Advisors: David Duvenaud and Gennady Pekhimenko, Sep 2020 - Sep 2021
Background
- Research Interests: Video generation and multi-modal models
- Field: Computer Science
- Brief Introduction: Building unified models to scale up data usage. Previously, did research on sparse attention.
Co-authors
10 total
Wenhu Chen
Assistant Professor at University of Waterloo
Ge Zhang
M-A-P, Bytedance, University of Waterloo
Xiang Yue
Carnegie Mellon University
Co-author 4
Yang Chen
Research Scientist, NVIDIA
Alan Ritter
Georgia Institute of Technology
Jie Fu
Shanghai AI Lab
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up