Scholar
Qiaozhi He
Google Scholar ID: 4bkFuLsAAAAJ
ByteDance
LLM
Natural Language Processing
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
31
H-index
3
i10-index
1
Publications
12
Co-authors
0
Contact
No contact links provided.
Publications
11 items
MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning
2026
Cited
0
DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering
2026
Cited
0
When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning
2026
Cited
0
APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards
2026
Cited
0
SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams
2026
Cited
0
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
2025
Cited
0
GRAM: A Generative Foundation Reward Model for Reward Generalization
2025
Cited
0
StickMotion: Generating 3D Human Motions by Drawing a Stickman
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up