Scholar

Michael S. Ryoo

Google Scholar ID: vcw0TJIAAAAJ

Stony Brook University; Salesforce AI

RoboticsComputer VisionMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

10,893

H-index

i10-index

Publications

Co-authors

173

list available

Contact

Emailmryoo@cs.stonybrook.edu CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

12 items

LACE: Latent Visual Representation for Cross-Embodiment Learning

2026

Cited

IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance

2026

Cited

Future Optical Flow Prediction Improves Robot Control&Video Generation

2026

Cited

Robotic VLA Benefits from Joint Learning with Motion Image Diffusion

2025

Cited

Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding

2025

Cited

MotionV2V: Editing Motion in a Video

2025

Cited

Pixel Motion Diffusion is What We Need for Robot Control

2025

Cited

Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data

2025

Cited

Resume (English only)

Academic Achievements

Publications:
- Pixel Motion as Universal Representation for Robot Control (arXiv:2505.07817)
- LLaRA: Supercharging Robot Learning Data for Vision-Language Policy (ICLR 2025)
- Understanding Long Videos in One Multimodal Language Model Pass (ICLR 2025)
- xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs (arXiv:2410.16267)
- Diffusion Illusions: Hiding Images in Plain Sight (SIGGRAPH 2024)
- Mirasol3B: A Multimodal Autoregressive Model for Time-aligned and Contextual Modalities (CVPR 2024)
- RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control (CoRL 2023)
- Active Vision Reinforcement Learning under Limited Visual Observability (NeurIPS 2023)
- Token Turing Machine
Awards:
- SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention received the Best Paper Award in Robot Manipulation at ICRA 2024
- Diffusion Illusions: Hiding Images in Plain Sight received CVPR 2023 Outstanding Demo Award

Research Experience

Currently an associate professor in the Department of Computer Science at Stony Brook University; former assistant professor at Indiana University Bloomington; former staff researcher within the Robotics Section of NASA's Jet Propulsion Laboratory (JPL).

Education

Ph.D. from the University of Texas at Austin in 2008; B.S. from Korea Advanced Institute of Science and Technology (KAIST) in 2004.

Background

Research interests include robotics, computer vision, and artificial intelligence. Worked with the AI research team at Salesforce, and previously with the robotics team at Google DeepMind (formerly Google Brain) for 5.5 years. Currently an associate professor in the Department of Computer Science at Stony Brook University.

Miscellany