Scholar
Yin Cui
Google Scholar ID: iP5m52IAAAAJ
Research Scientist, NVIDIA
Computer Vision
Machine Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
15,733
H-index
30
i10-index
43
Publications
20
Co-authors
176
list available
Contact
Email
richardaecn@gmail.com
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
9 items
SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
2026
Cited
0
DuoGen: Towards General Purpose Interleaved Multimodal Generation
2026
Cited
0
World Simulation with Video Foundation Models for Physical AI
2025
Cited
0
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary
2025
Cited
0
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning
2025
Cited
0
Describe Anything: Detailed Localized Image and Video Captioning
2025
Cited
0
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
2025
Cited
0
Cosmos World Foundation Model Platform for Physical AI
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
PAMI Mark Everingham Prize (2023) - COCO dataset
VideoGLUE: Video General Understanding Evaluation of Foundation Models (TMLR 2024)
Why Fine-grained Labels in Pretraining Benefit Generalization? (TMLR 2024)
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation (CVPR 2024)
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception (NeurIPS 2023)
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model (NeurIPS 2023)
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models (ICML 2023)
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models (ICLR 2023)
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels (ECCV 2022)
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation (ICLR 2022)
Surrogate Gap Minimization Improves Sharpness-Aware Training (ICLR 2022)
Education
Ph.D. in Computer Science from Cornell University and Cornell Tech in 2019, advised by Professor Serge Belongie.
Background
Currently a research scientist at NVIDIA. Previously, a research scientist at Google. Main research interests are Generative AI and Multimodal.
Miscellany
No other information provided
Co-authors
176 total
Serge Belongie
University of Copenhagen
Tsung-Yi Lin
Research Scientist, NVIDIA
Hartwig Adam
Sr. Director of Research, Google DeepMind
Boqing Gong
Boston University, Google
Rui Qian
Research Scientist, Apple
Quoc V. Le
Research Scientist, Google
Co-author 7
Ming-Hsuan Yang
University of California at Merced; Google DeepMind
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up