- Paper: 'Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning', arXiv 2025, Under Review
- Paper: 'RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation', arXiv 2025, Under Review
- Paper: 'EduHome: Leveraging LLMs for Human Behavioural Insights and Strategy Development through Parent–Child Homework Conversations', Under Review
- Paper: 'The Homework Wars: Exploring Emotions, Behaviours, and Conflicts in Parent-Child Homework Interactions', ACM IMWUT/UbiComp 2025
- Paper: 'HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents', ICCV 2025 Workshop on Multi-Modal Reasoning for Agentic Intelligence
- Paper: 'Self-Guide: A LLM Reasoning Enhancement Method Based on Self-Guided Planning', CCL 2024 / Journal of Chinese Information Processing
Research Experience
Research Experiences:
- University of North Carolina at Chapel Hill – Research Intern
- Advisor: Prof. Mingyu Ding
- Duration: June 2025 – Present, Remote
- Research Focus: Enhancing multimodal language models
Education
Degree: Bachelor's; University: Northeastern University; Advisors: Prof. Mingyu Ding (UNC-Chapel Hill), Prof. Yao (Mark) Mu (Shanghai Jiao Tong University); Time: Currently enrolled; Major: Artificial Intelligence.
Background
Research Interests: Language Grounding, Multimodal Reasoning and Planning, Human-Robot Interaction. Field: Artificial Intelligence. Brief Introduction: Senior undergraduate student at Northeastern University, China, focusing on developing foundation models that ground language and perception in real-world physical understanding, enabling robots to reason, plan, and act effectively in complex environments.
Miscellany
Personal Interests: Live, travel, adventure, bless, and don't be sorry. 🌍✨
Looking for a Ph.D. position starting in 2026 Fall. Please feel free to reach out!