Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Publications: ICLR 2025 'Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge'; AAAI 2025 (oral) 'CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation'; EMNLP 2024 Industry Track 'Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization'; ACL 2024 main conference 'ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models'.
Research Experience
Applied Scientist at Amazon AWS. Research projects include robustness of text classifiers and synthetic tabular generation.
Education
Ph.D. in Computer Science from MIT in 2023; B.Eng. from Tsinghua University.
Background
Research interests include natural language processing and machine learning. Recent work focuses on the application of large language models in the medical domain. Long-term objective is to pave the way for robust and deployable LLM applications, addressing challenges such as improving data quality, optimizing task-specific model performance, and developing reliable evaluation methods.