Yifan Yang
Scholar

Yifan Yang

Google Scholar ID: slhAlQ0AAAAJ
Shanghai Jiao Tong University, Tencent, Microsoft, Xiaomi
Spoken Language Processing
Citations & Impact
All-time
Citations
576
 
H-index
10
 
i10-index
11
 
Publications
20
 
Co-authors
14
list available
Resume (English only)
Academic Achievements
  • 1. August 2025, one paper accepted by IEEE Signal Processing Letters
  • 2. July 2025, funded by the CIE-Tencent Doctoral Research Incentive Project
  • 3. July 2025, three papers accepted by ACMMM 2025
  • 4. May 2025, two papers accepted by Interspeech 2025
  • 5. May 2025, three papers accepted by ACL 2025 (2 Main, 1 Findings)
  • 6. March 2025, one paper accepted by ICME 2025
  • 7. December 2024, one paper accepted by ICASSP 2025
  • 8. December 2024, one paper accepted by AAAI 2025
  • 9. June 2024, three papers accepted by Interspeech 2024
  • 10. January 2024, Zipformer accepted for oral presentation by ICLR 2024
  • 11. December 2023, three papers accepted by ICASSP 2024
  • 12. May 2023, two papers accepted by Interspeech 2023
Research Experience
  • 1. Research Intern, Hunyuan Team, Tencent Technology and Engineering Group (TEG), 2025.08.20-Present, Advised by Dr. Long Zhou, Led by Dr. Xu Tan
  • 2. Research Intern, VALL-E Team & CoreAI Speech, Microsoft, 2024.03.05-2025.08.10, Co-advised by Dr. Shujie Liu and Dr. Jinyu Li
  • 3. Machine Learning Engineer Intern, Next-gen Kaldi Team, Xiaomi AI Lab, 2022.11.01-2023.08.28, Advised by Dr. Daniel Povey
Education
  • 1. Ph.D., Computer Science and Technology, Shanghai Jiao Tong University, 2023.09-Present
  • 2. B.E., Computer Science and Technology, Tianjin University, 2019.09-2023.07, GPA: 3.91/4.0, Rank: 1/139
Background
  • Ph.D. student at Shanghai Jiao Tong University, a member of the Cross Media (X-)Language Intelligence Lab (X-LANCE) in the Department of Computer Science and Engineering. Research interests include Speech Large Language Models, Text-to-Speech Synthesis, Speech Representation Learning / Speech Tokenization (Continuous and Discrete), Multilingual Speech Recognition.