Published several papers, including 'CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models' (2024) and 'HumanPlus: Humanoid Shadowing and Imitation from Humans' (2024, Best Paper Award Finalist). Also contributed to the NVIDIA 2025 Cosmos World Foundation Model Platform project.
Research Experience
Involved in multiple research projects such as CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models; Cosmos World Foundation Model Platform for Physical AI; HumanPlus: Humanoid Shadowing and Imitation from Humans.
Education
Pursuing a Ph.D. in Electrical Engineering at Stanford Computational Imaging Lab, advised by Prof. Gordon Wetzstein.
Background
A final year Ph.D. student in Electrical Engineering at Stanford Computational Imaging Lab, with research interests in foundation models for perception, control, and modeling.