Scholar

Shimin Li

Google Scholar ID: 0xxkGjMAAAAJ

Fudan University

Large Language ModelSpeech Language Model

Google Scholar↗

Citations & Impact

All-time

Citations

4,388

H-index

12

i10-index

14

Publications

19

Co-authors

0

Contact

No contact links provided.

Publications

14 items

MOSS-Audio Technical Report

2026

Cited

0

MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions

2026

Cited

0

MOSS-TTSD: Text to Spoken Dialogue Generation

2026

Cited

0

MOSS-TTS Technical Report

2026

Cited

0

MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models

2026

Cited

0

WESR: Scaling and Evaluating Word-level Event-Speech Recognition

arXiv.org · 2026

Cited

0

MOSS Transcribe Diarize Technical Report

arXiv.org · 2026

Cited

0

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

2025

Cited

0

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)