Scholar

Bei Li

Google Scholar ID: wzbJ5EIAAAAJ

Meituan LLM Team

Machine TranslationDeep LearningLarge Language Models

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,058

H-index

i10-index

Publications

Co-authors

list available

Contact

Emaillibei_neu@outlook.com CVOpen ↗GitHubOpen ↗

Publications

33 items

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

2026

Cited

MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks

2026

Cited

Teacher-Guided Policy Optimization for LLM Distillation

2026

Cited

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

2026

Cited

MemoSight: Unifying Context Compression and Multi Token Prediction for Reasoning Acceleration

2026

Cited

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

2026

Cited

On the Emotion Understanding of Synthesized Speech

2026

Cited

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

2026

Cited

Resume (English only)

Academic Achievements

Published multiple papers in top international conferences such as NeurIPS, EMNLP, ACL, ICML, ICLR, COLING, ICASSP, AAAI between 2020 and 2025.

Research Experience

Worked at Natural Language Processing Lab; Internship at Microsoft Research Asia (MSRA) Natural Language Computing (NLC) from May 2022 to May 2023; Started a new internship at Machine Learning Group (ML) in December 2023.

Education

Bachelor's degree in 2017 from Northeastern University, majoring in Computer Science and Technology; Master's degree in 2020 from Northeastern University, majoring in Computer Software and Theory; Ph.D. from the Department of Computer Science and Technology at Northeastern University, supervised by Prof. Tong Xiao and Prof. Jingbo Zhu.

Background

Research interests include complex architecture modeling, deep transformers, multimodal modeling, and machine learning. Currently focusing on large language models such as prompt engineering via deliberation (DTG), evolutionary algorithms-based prompt search (EvoPrompt), foundation models (PCformer and its subsequent work), and DPO improvements (temporal-decay based DPO). Primary research domain is sequence generation tasks, including machine translation, abstractive summarization, etc.

Miscellany