Scholar

Yuxuan Zhu

Google Scholar ID: VxMpvw0AAAAJ

PhD student, University of Illinois Urbana-Champaign

Data systemsAI evaluation

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

138

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailyxx404@g.illinois.edu CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

26 items

MM-OptBench: A Solver-Grounded Benchmark for Multimodal Optimization Modeling

2026

Cited

MeasHalu: Mitigation of Scientific Measurement Hallucinations for Large Language Models with Enhanced Reasoning

2026

Cited

ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval

2026

Cited

ReViSQL: Achieving Human-Level Text-to-SQL

2026

Cited

Accelerating Approximate Analytical Join Queries over Unstructured Data with Statistical Guarantees

2026

Cited

Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards

2026

Cited

ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation

2026

Cited

Rethinking Generative Recommender Tokenizer: Recsys-Native Encoding and Semantic Quantization Beyond LLMs

2026

Cited

Resume (English only)

Academic Achievements

Accelerate Aggregation Queries with JOINs over Unstructured Data (In submission)
Breaking Barriers: Do Reinforcement Post Training Gains Transfer To Unseen Domains? (Preprint)
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities (Preprint)
Establishing Best Practices for Building Rigorous Agentic Benchmarks (NeurIPS 2025, first place at Berkeley AgentX Summit (Benchmark & Evaluation Track))
ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines (VLDB 2026)
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench (ACL main 2025)
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities (ICML 2025 Spotlight, SafeBench winner, adopted by US AISI, second place at Berkeley AgentX Summit (AI Safety & Alignment Track))
PilotDB: Database-Agnostic Online Approximate Query Processing with A Priori Error Guarantees (SIGMOD 2025)
Efficient Approximate Query Processing with Block Sampling (CIDR 2025)
FedTrans: Efficient Federated Learning via Multi-Model Transformation (MLSys 2024)
SlabCity: Whole-Query Optimization using Program Synthesis (VLDB 2023)
An Energy-efficient Computing Offloading Framework for Blockchain-enabled Video Streaming Systems (GlobeCom 2022)
Sharding for Blockchain-based Mobile Edge Computing System: A Deep Reinforcement Learning Approach (GlobeCom 2021)

Background

Research interests: data + AI/ML; Research focus: developing statistically grounded approaches to enable efficient data analytics, rigorous AI evaluations, and AI for safety.

Co-authors

7 total