- Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression (arXiv preprint arXiv:2412.03293)
- TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation (RA-L 2025)
- Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation (ICRA 2025)
- Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation (ICRA 2025)
- Language-Conditioned Robotic Manipulation with Fast and Slow Thinking (ICRA 2024)
- Object-Centric Instruction Augmentation for Robotic Manipulation (ICRA 2024)
- A Survey on Robotics with Foundation Models: toward Embodied AI (arXiv preprint arXiv:2402.02385)
Research Experience
Embodied Intelligence Algorithm Intern at Midea Group.
Education
Master's Student, East China Normal University (ECNU), Advisor: Chaomin Shen; Bachelor's Degree, Southwest Jiao Tong University (SWJTU), Software Engineering.
Background
Research Interests: robot learning, embodied intelligence, vision language action models; Professional Field: Software Engineering.