Paper 'XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples' accepted to NAACL Findings 2025
Paper 'A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models' accepted to NAACL Findings 2025
Paper 'Emma-500: Enhancing Massively Multilingual Adaptation of Large Language Models' published on arXiv 2024
Paper 'MaLA-500: Massive Language Adaptation of Large Language Models' published on arXiv 2024
Paper 'mPLM-Sim: Unveiling Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models' accepted to EACL Findings 2024
Paper 'Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages' accepted to ACL 2023 and received Area Chair Award
Served as Program Committee member or reviewer for ACL Rolling Review, ACL, EMNLP, ICLR, NeurIPS, AAAI, SocialNLP, etc.
Journal reviewer for TALLIP
Background
Originally from Haimen, China
Currently an ELLIS Ph.D. student at the Center for Information and Language Processing (CIS), LMU Munich, and SARDINE Group, Instituto Superior Técnico
Supervised by Hinrich Schütze and André F. T. Martins
Research focuses on multilingualism and cross-lingual NLP