* Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain (ACL-IJCNLP 2021)
* Next Sentence Prediction helps Implicit Discourse Relation Classification within and across Domains (EMNLP-IJCNLP 2019)
* A Hybrid Model for Globally Coherent Story Generation (StoryNLP@ACL 2019)
* Acquiring Annotated Data with Cross-lingual Explicitation for Implicit Discourse Relation Classification (DISRPT@NAACL 2019)
* Learning to Explicitate Connectives with Seq2Seq Network for Implicit Discourse Relation Classification (IWCS 2019)
* Using Explicit Discourse Relation Connectives in Translation for Implicit Discourse Relation Classification (IJCNLP 2017)
* On the Need of Cross Validation for Discourse Relation Classification (EACL 2017)
* Attention-based Bidirectional Long Short-term Memory Networks for Relation Classification (ACL 2016)
- Professional Activities: PC member of IEEE TCSS, CoNLL 2018-2019, NAACL-HLT 2019, ACL 2019-2020, AACL-IJCNLP 2020
Research Experience
- 2024-01: Left MiniMax Inc., but still working on LLMs
- 2022-06: Left Alibaba and joined MiniMax Inc., a startup company focusing on LLM and Multimodality models
- 2021: Joined DAMO Academy, Alibaba Group as a Senior Algorithm Engineer
- During his Ph.D., he worked at the Department of Language Science and Technology and Collaborative Research Center SFB-1102 of Saarland University, Germany, researching Natural Language Processing
Education
- 2016 – 2020: Ph.D. in Computational Linguistics, Saarland University, Germany, Advisor: Prof. Dr. Vera Demberg
- 2013 – 2016: M.Sc. in Computer Science, Institute of Automation, Chinese Academy of Sciences, Beijing, China, Advisors: Prof. Dr. Hongwei Hao and Prof. Dr. Bo Xu
- 2008 – 2012: B.Eng. in Automation, Wuhan University, Hubei, China
Background
- Research Interests: Large Language Models, Multimodal AI, Discourse Relation Parsing, Sentiment Analysis, Text Generation, Relation Extraction, Deep Learning, Natural Language Understanding