Scholar

Xuezhe Ma

Google Scholar ID: 6_MQLIcAAAAJ

Information Sciences Institute, University of Southern California

Natural Language ProcessingMachine LearningDeep Generative ModelsDependency Parsing

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

9,165

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

12 items

Are VLMs Seeing or Just Saying? Uncovering the Illusion of Visual Re-examination

2026

Cited

GQA-{\mu}P: The maximal parameterization update for grouped query attention

2026

Cited

EMO: Frustratingly Easy Progressive Training of Extendable MoE

2026

Cited

Asymmetric Idiosyncrasies in Multimodal Models

2026

Cited

UReason: Benchmarking the Reasoning Paradox in Unified Multimodal Models

2026

Cited

Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths

arXiv.org · 2026

Cited

Don't Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning

2025

Cited

AIDE: Agentically Improve Visual Language Model with Domain Experts

2025

Cited

Resume (English only)

Academic Achievements

Published multiple papers including 'Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length' and others; Received Outstanding Paper Award at EMNLP 2023.

Research Experience

Since Fall 2020, Research Lead at Information Sciences Institute, University of Southern California; Focuses on developing efficient unified neural architectures and learning algorithms to learn a universal semantic space for various data modalities, as well as efficient and robust architectures and methods for modeling long-range dependencies in LLMs.

Education

Since Fall 2020, Research Assistant Professor at the Department of Computer Science, University of Southern California; Ph.D. from Language Technologies Institute, Carnegie Mellon University, under the supervision of Prof. Eduard Hovy; Master's degree from Center for Brain-like Computing and Machine Intelligence, Shanghai Jiao Tong University, China; Bachelor's degree in Computer Science from Shanghai Jiao Tong University, member of ACM Class, now part of Zhiyuan College in SJTU.

Background

Research interests include representation learning techniques based on deep learning methods, aiming to enhance the effectiveness, efficiency, interpretability, and robustness of representation learning. Special focus on the efficiency of multi-modal large language models (LLMs), efficient and robust long-context modeling in LLMs, and applications and evaluation methods for multi-modal LLMs on long sequential data.

Co-authors

79 total