Scholar
Songjun Tu
Google Scholar ID: _5Ir0soAAAAJ
Institute of Automation, Chinese Academy of Sciences; Pengcheng Laboratory
Large Language Models
Reinforecement Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
56
H-index
5
i10-index
2
Publications
11
Co-authors
2
list available
Contact
No contact links provided.
Publications
12 items
One LR Doesn't Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
2026
Cited
0
STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery
2026
Cited
0
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning
2026
Cited
0
$π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
2026
Cited
0
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
2026
Cited
0
Dynamic Dual-Granularity Skill Bank for Agentic RL
2026
Cited
0
Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization
2025
Cited
0
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
2025
Cited
0
Load more
Resume (English only)
Co-authors
2 total
Qichao Zhang
中国科学院自动化研究所
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up