Scholar
Jiajun Chai
Google Scholar ID: yDdfap0AAAAJ
Meituan Inc.
Reinforcement Learning
LLMs
Agentic Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
209
H-index
6
i10-index
5
Publications
20
Co-authors
5
list available
Contact
No contact links provided.
Publications
26 items
Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration
2026
Cited
0
ZipRL: Adaptive Multi-Turn Context Compression with Hindsight Response Replay
2026
Cited
0
When Self-Belief Misleads: Active Label Acquisition for Reinforcement Learning with Verifiable Rewards
2026
Cited
0
AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment
2026
Cited
0
Implicit Hierarchical GRPO: Decoupling Tool Invocation from Execution for Tool-Integrated Mathematical Reasoning
2026
Cited
0
RecRM-Bench: Benchmarking Multidimensional Reward Modeling for Agentic Recommender Systems
2026
Cited
0
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
2026
Cited
0
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning
2026
Cited
0
Load more
Resume (English only)
Co-authors
5 total
Yuanheng Zhu
Institute of Automation, Chinese Academy of Sciences
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences
Yuqian Fu
Institute of Automation,Chinese Academy of Sciences
Guojun Yin
Meituan, University of Science and Technology of China
Mingrui Yu (于铭瑞)
PhD student, Department of Automation, Tsinghua University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up