Publications include: 'Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward' (NeurIPS 2025), 'Infer Human's Intentions Before Following Natural Language Instructions' (AAAI 2025), 'Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning' (NeurIPS 2024), 'Toward a More Complete OMR Solution' (ISMIR 2024), 'HandMeThat: Human-Robot Communication in Physical and Social Environments' (NeurIPS Datasets and Benchmarks Track 2022).
Research Experience
No specific research or work experience mentioned.
Education
PhD: Computer Science, University of Washington, advised by Natasha Jaques; BEng: Computer Science and Technology, YaoClass, Tsinghua University.
Background
Research Interests: Artificial Intelligence, Social Reinforcement Learning, Large Language Models. About Me: A second-year PhD student in Computer Science and Engineering at the University of Washington, focusing on building AI models that can better understand human intentions or preferences and interact with humans in the real world.
Miscellany
No personal interests, hobbies, or other information explicitly mentioned.