Scholar
Shimin Li
Google Scholar ID: 0xxkGjMAAAAJ
Fudan University
Large Language Model
Speech Language Model
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
4,388
H-index
12
i10-index
14
Publications
19
Co-authors
0
Contact
No contact links provided.
Publications
13 items
MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
2026
Cited
0
MOSS-TTSD: Text to Spoken Dialogue Generation
2026
Cited
0
MOSS-TTS Technical Report
2026
Cited
0
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
2026
Cited
0
WESR: Scaling and Evaluating Word-level Event-Speech Recognition
arXiv.org · 2026
Cited
0
MOSS Transcribe Diarize Technical Report
arXiv.org · 2026
Cited
0
MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
2025
Cited
0
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up