Scholar

Lingwei Meng

Google Scholar ID: Vtirkf4AAAAJ

ByteDance; The Chinese University of Hong Kong

Speech and Language ProcessingSpeech RecognitionSpeech Synthesis

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

840

H-index

i10-index

Publications

Co-authors

list available

Contact

TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

15 items

Looped World Models

2026

Cited

Agentic Cognitive Profiling: Realigning Automated Alzheimer's Disease Detection with Clinical Construct Validity

2026

Cited

StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling

2025

Cited

Towards One-bit ASR: Extremely Low-bit Conformer Quantization Using Co-training and Stochastic Precision

2025

Cited

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

2025

Cited

Dynamic Uncertainty Learning with Noisy Correspondence for Text-Based Person Search

2025

Cited

Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis

2025

Cited

$C^2$AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction

2025

Cited

Resume (English only)

Academic Achievements

Published multiple papers in international conferences such as ACL, ICASSP, and INTERSPEECH; received the 2025 IEEE Ganesh N. Ramaswamy Memorial Student Grant; co-authored a survey on Next Token Prediction Towards Multimodal Intelligence.

Research Experience

Currently a Research Scientist at ByteDance Seed; previously a Research Intern at Microsoft Research Asia, working on language modeling for speech synthesis and integrating speech with large language models.

Education

Ph.D., The Chinese University of Hong Kong, Human-Computer Communications Laboratory (HCCL), supervised by Prof. Helen Meng; M.Phil., Institute of Automation, Chinese Academy of Sciences (CASIA), supervised by Prof. Jie Tian; B.Sc., Harbin Institute of Technology (HIT).

Background

Research interests include language modeling for speech synthesis and the integration of speech with large language models. Also working on speech processing and recognition.

Co-authors

20 total