Scholar

Bowen Qu

Google Scholar ID: m8ttvBkAAAAJ

Peking University, Ex: Rhymes.ai Aria Team

Multimodal learningVision-Language ModelsComputer Vision

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

541

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailbrian.bw.qu@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

9 items

Vectors Are Not Neutral: Sensitive-Information Inference from Exported LLM Representations in Summarization

2026

Cited

Latent Diffusion Inversion Requires Understanding the Latent Space

2025

Cited

IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment

2025

Cited

Score-based Membership Inference on Diffusion Models

2025

Cited

Kimi-VL Technical Report

2025

Cited

Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency

2025

Cited

IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment

2025

Cited

Aria: An Open Multimodal Native Mixture-of-Experts Model

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Publications:
- 'ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding', ICLR2025 Oral (1.8%).
- 'Aria: An Open Multimodal Native Mixture-of-Experts Model', Technical Report.
- 'Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap', CVPRW2024.
- 'Bringing Textual Prompt to AI-Generated Image Quality Assessment', ICME2024.
Projects Released:
- 2025.02: ChartMoE selected as ICLR2025 Oral.
- 2025.01: ChartMoE accepted by ICLR2025.
- 2024.10: Released Aria, a native LMM excelling in text, code, image, video, PDF, and more.
- 2024.09: Released ChartMoE, an MLLM with MoE connector for advanced chart understanding, replot, editing, highlighting, and transformation.

Research Experience

Participated in several interesting MLLM research projects:
- 2024.05 - 2024.12: 01.ai & Rhymes.ai, Multimodal Team, supervised by Junnan Li, working closely with Dongxu Li and Haoning Wu.
- 2024.02 - 2024.07: IDEA Research, working closely with Zhengzhuo Xu, Yiyan Qi, and Chengjin Xu.

Education

An MPhil. Candidate at the School of Electronic and Computer Science (SECE), Peking University (PKU) since 2022 fall; previously received an Honours B.E. degree from the School of Electronic Information and Communications (EIC), Huazhong University of Science and Technology (HUST) in June 2022.

Background