Scholar
Jiazheng Zhang
Google Scholar ID: qw4vwfAAAAAJ
Fudan University
Large Language Model
Natural Language Processing
Data Mining
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
24
H-index
3
i10-index
1
Publications
10
Co-authors
7
list available
Contact
No contact links provided.
Publications
13 items
Can RL Improve Generalization of LLM Agents? An Empirical Study
2026
Cited
0
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents
2026
Cited
0
DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training
2026
Cited
0
DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training
2025
Cited
0
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
2025
Cited
0
Understanding and Mitigating Errors of LLM-Generated RTL Code
2025
Cited
0
VRPO: Rethinking Value Modeling for Robust RL Training under Noisy Supervision
2025
Cited
0
Mitigating Attention Hacking in Preference-Based Reward Modeling via Interaction Distillation
2025
Cited
0
Load more
Resume (English only)
Co-authors
7 total
Co-author 1
Zhiheng Xi
Fudan University
Tao Gui (桂韬)
复旦大学
Huang Xuanjing (黄萱菁)
Professor of Computer Science, Fudan University
Shihan Dou
Fudan University
Qi Zhang (张奇)
Professor of Computer Science, Fudan University
Xipeng Qiu(邱锡鹏)
Professor of Computer Science, Fudan University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up