Scholar
Yale Song
Google Scholar ID: dNHNpxoAAAAJ
Google
Computer Vision
Multimodal Learning
Representation Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
5,822
H-index
34
i10-index
59
Publications
20
Co-authors
131
list available
Contact
No contact links provided.
Publications
8 items
PaperOrchestra: A Multi-Agent Framework for Automated AI Research Paper Writing
2026
Cited
0
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
2026
Cited
0
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement
2026
Cited
0
PaperBanana: Automating Academic Illustration for AI Scientists
2026
Cited
0
Enrich and Detect: Video Temporal Grounding with Multimodal LLMs
2025
Cited
0
Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction
2025
Cited
0
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
2025
Cited
0
VITED: Video Temporal Evidence Distillation
2025
Cited
0
Resume (English only)
Background
Research interests: Machine Learning, Data Science, and Artificial Intelligence. Passionate about using technology to solve real-world problems.
Miscellany
Hobbies include hiking, reading, and playing chess.
Co-authors
131 total
Alejandro Jaimes
Chief AI Officer at Dataminr
Daniel McDuff
Google and University of Washington
Co-author 3
Gunhee Kim
Professor, Seoul National University
Shuang Ma
Apple AI/ML
YoungJae Yu
Seoul National University
Co-author 7
Yunseok Jang
University of Michigan, Ann Arbor
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up