Over 6,000 citations on Google Scholar; published numerous high-impact papers at top-tier conferences including CVPR, NeurIPS, and ICLR; “Learning to Compare: Relation Network for Few-Shot Learning” has achieved over 5,575 citations; maintaining the widely acclaimed “Deep-Learning-Papers-Reading-Roadmap” with over 39.1k GitHub stars.
Research Experience
Serving as a Member of Technical Staff and RL Lead at Moonshot AI; leading research on Long Chain-of-Thought reasoning and building foundation models for artificial general intelligence; advancing sample-efficient RL algorithms, meta-RL, and their integration with large language models.
Background
AI Researcher & RL Specialist, currently serving as a Member of Technical Staff at Moonshot AI, leading reinforcement learning research towards AGI. Pioneer in few-shot learning, meta-learning, and the innovative Long Chain-of-Thought reasoning.
Miscellany
Passionate open-source contributor; actively shares insights through blog posts, academic conferences, and technical talks; recognized as an 'AGI, Metaverse, and Robotics Evangelist'.