Published multiple papers such as [Paper Review] Pi0, Pi0.5, Pi0-FAST - Tracing the Path of Physical Intelligence (PI), [Paper Review] OpenVLA: An Open-Source Vision-Language-Action Model, [Paper Review] Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs, etc.
Research Experience
Research areas include Robotics, Embodied AI, Vision-Language Model, Large Language Model, 3D Generation, Diffusion, and Reinforcement Learning.
Background
Machine Learning Researcher. Focused on building general-purpose embodied AI by combining machine learning with real-world interaction.
Miscellany
Personal blog covers research insights and paper reviews in robotics and embodied AI.