Published over 50 peer-reviewed papers in top-tier NLP venues
Developed notable resources and tools including:
- Chinese Word Vectors (over 12k stars on GitHub)
- CCA, CLRA, and L2C-Cohesion for text complexity analysis
- L2C-rater for automated essay scoring
- AI Taiyan (Taiyan): an LLM for translation and annotation of Classical Chinese
Publications in high-impact venues such as Nature Human Behaviour, Language Learning, Behavior Research Methods, Studies in Second Language Acquisition, and EMNLP