AgoraResearch hub
ExploreLibraryProfile
Account
Zhenbo Luo
Scholar

Zhenbo Luo

Google Scholar ID: Sh6y-_EAAAAJ
XiaoMi
Vision Language ModelComputer Vision
Google Scholar↗
Citations & Impact
All-time
Citations
3,360
 
H-index
16
 
i10-index
20
 
Publications
20
 
Co-authors
13
list available
Contact
No contact links provided.
Publications
29 items
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering
2026
Cited
0
Q-Mask: Query-driven Causal Masks for Text Anchoring in OCR-Oriented Vision-Language Models
2026
Cited
0
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously
2026
Cited
0
IMTBench: A Multi-Scenario Cross-Modal Collaborative Evaluation Benchmark for In-Image Machine Translation
2026
Cited
0
PatchCue: Enhancing Vision-Language Model Reasoning with Patch-Based Visual Cues
2026
Cited
0
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models
2026
Cited
0
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding
2026
Cited
0
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding
2026
Cited
0
Resume (English only)
Co-authors
13 total
Co-author 1
Co-author 1
Co-author 2
Co-author 2
Co-author 3
Co-author 3
Pei Fu (付培)
Pei Fu (付培)
Xiaomi
Co-author 5
Co-author 5
Zengchang Qin
Zengchang Qin
Beihang University
Fei Yin
Fei Yin
NLPR, CASIA
Cheng-Lin Liu
Cheng-Lin Liu
Institute of Automation, Chinese Academy of Sciences

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?