2025: Paper 'Data Scaling Laws in Imitation Learning for Robotic Manipulation' accepted at ICLR (Oral Presentation); received Best Paper Award at CoRL 2024 Workshop on X-Embodiment Robot Learning
2025: Contributed to GR-3, a large-scale vision-language-action (VLA) model (ByteDance Seed Technical Report)
2025: Proposed OneTwoVLA, a unified VLA model with adaptive reasoning and acting capabilities (Preprint)
2024: Presented ViLa at ICRA 2024 workshop, leveraging GPT-4V for long-horizon robotic vision-language planning
2023: Paper 'A Universal Semantic-Geometric Representation for Robotic Manipulation' accepted at CoRL 2023
2023: Paper 'For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal' accepted at ICML 2023