Scholar

Tarun Suresh

Google Scholar ID: Yxx6B5YAAAAJ

Undergraduate, University of Illinois Urbana-Champaign

Deep LearningMachine LearningReinforcement LearningProgramming LanguagesFormal Methods

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

283

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailtarsur909@gmail.com GitHubOpen ↗LinkedInOpen ↗

Publications

17 items

Agentic Separation Logic Specification Synthesis

2026

Cited

TRACE: Capability-Targeted Agentic Training

2026

Cited

SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization

2025

Cited

BEAVER: An Efficient Deterministic LLM Verifier

2025

Cited

InvBench: Can LLMs Accelerate Program Verification with Invariant Synthesis?

2025

Cited

DINGO: Constrained Inference for Diffusion LLMs

2025

Cited

Learning a Pessimistic Reward Model in RLHF

2025

Cited

SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas

2025

Cited

Resume (English only)

Academic Achievements

NeurIPS 2025: DINGO: Constrained Inference for Diffusion LLMs (co-first author) – introduced the first dynamic-programming-based decoding strategy for diffusion LLMs that provably satisfies regex constraints
ICML 2025: CRANE: Reasoning with Constrained LLM Generation (co-first author) – theoretically analyzed and mitigated reasoning degradation under output constraints, boosting accuracy by up to 10% on symbolic reasoning tasks
ICLR 2025: IterGen: Iterative Semantic-aware Structured LLM Generation with Backtracking – developed a library enabling grammar-symbol-level backtracking to repair semantic violations, improving SQL accuracy by 18% and preventing privacy leakage
TMLR 2025: SynCode: LLM Generation with Grammar Augmentation – proposed a sound and complete grammar-guided LLM generation framework, reducing syntax errors by 96–100% and accelerating inference by 1.5–10x
TMLR 2025: Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions – co-developed the PRC method that constrains action space to improve stability in offline preference-based RL
ARLET @ NeurIPS 2025: Learning a Pessimistic Reward Model in RLHF – contributed to PET, a pessimistic reward fine-tuning approach robust against reward hacking in offline RLHF

Co-authors

5 total

Gagandeep Singh

Assistant Professor, Department of Computer Science, UIUC

Shubham Ugare

University of Illinois

Sasa Misailovic

University of Illinois at Urbana–Champaign

Debangshu Banerjee

PhD Student, Computer Science, UIUC

Revanth Gangi Reddy

Google DeepMind