Shaoxiong Ji
Scholar

Shaoxiong Ji

Google Scholar ID: t3ZA0WsAAAAJ
Technical University of Darmstadt
Machine LearningNatural Language ProcessingHealth Informatics
Citations & Impact
All-time
Citations
6,887
 
H-index
26
 
i10-index
37
 
Publications
20
 
Co-authors
85
list available
Resume (English only)
Academic Achievements
  • - Released EMMA-500 Llama 3/3.1 models and MaLA bilingual translation corpus in 2,500+ language pairs
  • - Released a series of CPT models that study the data mixing in continual pre-training
  • - Released the preview of GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models
  • - One paper on multilingual instruction fine-tuning accepted at COLING 2025
  • - One paper on LM vs. MT accepted at EMNLP 2024
Research Experience
  • - Principal Investigator at ELLIS Institute Finland
  • - Assistant Professor at the Department of Computer Science, University of Turku, Finland
  • - Independent Research Group Leader at Technical University of Darmstadt
  • - Postdoctoral Researcher at the University of Helsinki, working on high-performance language technology
  • - Worked as a research assistant or visiting scholar at University of Technology Sydney (UTS), The University of Queensland (UQ), Nanyang Technological University (NTU), Finnish Institute for Health and Welfare (THL), University of Munich (LMU), and Shanghai AI Lab
Education
  • - Doctor of Science (Technology): Aalto University, Finland
  • - Master of Philosophy: The University of Queensland, Australia
  • - Bachelor of Engineering: Dalian University of Technology, China
Background
  • A principal investigator at ELLIS Institute Finland and an assistant professor at the Department of Computer Science, University of Turku, Finland. His research interests include Machine Learning, Natural Language Processing, and AI for Health.
Miscellany
  • Welcomes Master's students looking for thesis opportunities and visiting students and researchers in NLP and related fields to work with them.