Scholar
Shiyin Kang
Google Scholar ID: mnCHk8EAAAAJ
Sensetime Inc.
Speech Synthesis
Voice Conversion
Speech Recognition
Machine Learning
High Performance Computing
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
2,914
H-index
24
i10-index
43
Publications
20
Co-authors
49
list available
Contact
No contact links provided.
Publications
5 items
How Should LLMs Listen While Speaking? A Study of User-Stream Routing in Full-Duplex Spoken Dialogue
2026
Cited
0
Towards Streaming Target Speaker Extraction via Chunk-wise Interleaved Splicing of Autoregressive Language Model
2026
Cited
0
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue
2025
Cited
0
AdaMesh: Personalized Facial Expressions and Head Poses for Adaptive Speech-Driven 3D Facial Animation
IEEE transactions on multimedia · 2023
Cited
1
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
Interspeech · 2019
Cited
24
Resume (English only)
Co-authors
49 total
Zhiyong WU (吴志勇)
Associate Professor, Tsinghua University
Dong Yu (俞栋)
Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA Fellow
Dan Su
Tencent AI Lab
Co-author 4
Kun Li
SpeechX Limited
Xunying Liu
Chinese University of Hong Kong
Shun Lei
PhD student, Tsinghua University
Songxiang Liu
Meituan multi-modal team, PhD (The Chinese University of Hong Kong)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up