First author of "Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets", published at IEEE 14th International Conference on Speech and Language Processing (2024)
First author of "OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia", arXiv preprint arXiv:2501.13306 (2025)
First author of "Domain-Specific Prompts for LLM-based ASR: An Empirical Study" (to be published)
Co-author of "Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning", published at ACM International Conference on Multimedia (ACM MM, 2025)
Co-author of "Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought" (to be published)
Co-author of "Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text" (to be published)
Co-author of "Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty", published at Interspeech 2025