AgoraResearch hub
ExploreLibraryProfile
Account
Jian Luan
Scholar

Jian Luan

Google Scholar ID: 6Z8RUi4AAAAJ
Toshiba, Microsoft, Xiaomi
LLMVLMTTSSinging Synthesis
Google Scholar↗
Citations & Impact
All-time
Citations
1,354
 
H-index
18
 
i10-index
27
 
Publications
20
 
Co-authors
6
list available
Contact
No contact links provided.
Publications
66 items
Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models
2026
Cited
0
Iterate to Differentiate: Enhancing Discriminability and Reliability in Zero-Shot TTS Evaluation
2026
Cited
0
ACAVCaps: Enabling large-scale training for fine-grained and diverse audio understanding
2026
Cited
0
The Interspeech 2026 Audio Encoder Capability Challenge for Large Audio Language Models
2026
Cited
0
Borderless Long Speech Synthesis
2026
Cited
0
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously
2026
Cited
0
IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation
2026
Cited
0
From Ideal to Real: Stable Video Object Removal under Imperfect Conditions
2026
Cited
0
Resume (English only)
Co-authors
6 total
Xu Tan
Xu Tan
Principal Researcher and Research Manager, Microsoft
Co-author 2
Co-author 2
Zhiyong WU (吴志勇)
Zhiyong WU (吴志勇)
Associate Professor, Tsinghua University
Co-author 4
Co-author 4
Shengchen Li
Shengchen Li
Xi'an Jiaotong-Liverpool University
YUANBO HOU
YUANBO HOU
University of Oxford

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?