Scholar
Qingkai Fang
Google Scholar ID: n2lRntoAAAAJ
Institute of Computing Technology, Chinese Academy of Sciences
Large Language Models
Speech Language Models
Multimodal LLMs
Speech Translation
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
667
H-index
12
i10-index
13
Publications
20
Co-authors
12
list available
Contact
No contact links provided.
Publications
5 items
FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing
2025
Cited
0
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
2025
Cited
0
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
2025
Cited
0
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
2025
Cited
0
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
arXiv.org · 2024
Cited
18
Resume (English only)
Co-authors
12 total
Shaolei Zhang
Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)
Zhengrui Ma
Institute of Computing Technology, Chinese Academy of Sciences
Yan Zhou
Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)
shoutao guo
Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)
Langlin Huang
Washington University in St. Louis
Zhuocheng Zhang
Institute of Computing Technology, Chinese Academy of Science
Min Zhang
Professor of Computer Science, Soochow University
Rong Ye
ByteDance
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up