Scholar

Aviral Kumar

Google Scholar ID: zBUwaGkAAAAJ

Carnegie Mellon University

AIReinforcement Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

25,605

H-index

50

i10-index

70

Publications

20

Co-authors

8

list available

Contact

No contact links provided.

Publications

33 items

Addressing Over-Refusal in LLMs with Competing Rewards

2026

Cited

0

ExpRL: Exploratory RL for LLM Mid-Training

2026

Cited

0

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

2026

Cited

0

Recursive Agent Optimization

2026

Cited

0

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

2026

Cited

0

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

2026

Cited

0

What Does Flow Matching Bring To TD Learning?

2026

Cited

0

BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames

2026

Cited

0

Resume (English only)

Co-authors

8 total

UC Berkeley, Physical Intelligence

Google DeepMind

Stanford University, Physical Intelligence

Rishabh Agarwal

Meta, ex DeepMind, Google Brain

Stanford University

Google DeepMind

Google DeepMind