Scholar

Jun Rao

Google Scholar ID: gbhq6EoAAAAJ

Harbin Institute of Technology (Shenzhen)

LLMsEfficient Post-trainingKnowledge DistillationMultimodal

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

293

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailrao7jun@gmail.com GitHubOpen ↗

Publications

7 items

MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

2026

Cited

PACE: Defying the Scaling Hypothesis of Exploration in Iterative Alignment for Mathematical Reasoning

2026

Cited

SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models

2025

Cited

AQuilt: Weaving Logic and Self-Inspection into Low-Cost, High-Relevance Data Synthesis for Specialist LLMs

2025

Cited

APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training

2025

Cited

REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models

2025

Cited

Dynamic Sampling that Adapts: Iterative DPO for Self-Aware Mathematical Reasoning

2025

Cited

Resume (English only)

Academic Achievements

Published over 20 papers in top conferences and international journals including ACL, EMNLP, SIGIR, MM, CIKM, TALLIP, IPM, ToMM, and TMM. Specific publications include 'SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models' (EMNLP 2025), 'APT: Improving Specialist LLM Performance with Weakness Case Acquisition and Iterative Preference Training' (ACL 2025), 'CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions' (EMNLP 2024), etc.

Research Experience

Involved in multiple research projects, published numerous papers in areas such as multimodal learning, knowledge distillation, and image-text retrieval.

Education

Third-year Ph.D. candidate at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), co-advised by Min Zhang and Xuebo Liu.

Background

Research interests include NLP, large language models (LLMs), and knowledge distillation (KD). Long-term research goal is to build socially intelligent embodied agents with the ability to perceive and engage in multimodal human communication. Research focuses on: 1) fundamentals of multimodal learning, specifically the representation, translation, fusion, and alignment of heterogeneous data sources; 2) human-centered language and vision and their applications; 3) real-world deployment efficiency of LLMs.

Miscellany