AgoraResearch hub
ExploreLibraryProfile
Account
Songjun Tu
Scholar

Songjun Tu

Google Scholar ID: _5Ir0soAAAAJ
Institute of Automation, Chinese Academy of Sciences; Pengcheng Laboratory
Large Language ModelsReinforecement Learning
Google Scholar↗
Citations & Impact
All-time
Citations
56
 
H-index
5
 
i10-index
2
 
Publications
11
 
Co-authors
2
list available
Contact
No contact links provided.
Publications
12 items
One LR Doesn't Fit All: Heavy-Tail Guided Layerwise Learning Rates for LLMs
2026
Cited
0
STRIDE: A Self-Reflective Agent Framework for Reliable Automatic Equation Discovery
2026
Cited
0
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning
2026
Cited
0
$π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
2026
Cited
0
Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning
2026
Cited
0
Dynamic Dual-Granularity Skill Bank for Agentic RL
2026
Cited
0
Perception-Consistency Multimodal Large Language Models Reasoning via Caption-Regularized Policy Optimization
2025
Cited
0
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
2025
Cited
0
Resume (English only)
Co-authors
2 total
Qichao Zhang
Qichao Zhang
中国科学院自动化研究所
Dongbin Zhao
Dongbin Zhao
Institute of Automation, Chinese Academy of Sciences

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?