Scholar

Taiyi Wang

Google Scholar ID: lkVmQH8AAAAJ

Computer Lab, University of Cambridge

Machine LearningReinforcement LearningDatabaseMachine Learning System

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

133

H-index

i10-index

Publications

Co-authors

list available

Contact

EmailTaiyi.Wang@cl.cam.ac.uk CVOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

10 items

EDIT: Evidence-Diagnosed Intervention Training for Rule-Faithful LLM Grading

2026

Cited

Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics

2026

Cited

A Subgoal-driven Framework for Improving Long-Horizon LLM Agents

2026

Cited

OServe: Accelerating LLM Serving via Spatial-Temporal Workload Orchestration

2026

Cited

AutoIndexer: A Reinforcement Learning-Enhanced Index Advisor Towards Scaling Workloads

2025

Cited

ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments

2025

Cited

A New Paradigm in Tuning Learned Indexes: A Reinforcement Learning Enhanced Approach

2025

Cited

OCMDP: Observation-Constrained Markov Decision Process

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Publications: 1. OCMDP: Observation-Constrained Markov Decision Process, IJCNN 2025; 2. ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments, MLSys 2025; 3. A New Paradigm in Tuning Learned Indexes: A Reinforcement Learning-Enhanced Approach, SIGMOD 2025, Awarded by SIGMOD 2025 Student Grant; 4. DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents, ICLR 2025.

Research Experience

Currently a Research Scientist Intern at the Autonomous Agents Team, Google DeepMind, London, UK.

Education

PhD: Computer Science and Technology, University of Cambridge, Advisor: Dr. Eiko Yoneki; Master's: Engineering, Johns Hopkins University, Graduated as the top 1 student in the class; Bachelor's: Physics, Peking University, Minor in Math, Honored with the Excellent Graduate Student Award and Excellent Graduation Thesis Award.

Background

Research Interests: Reinforcement Learning, Machine Learning, Machine Learning Systems, and the enhancement of real systems (e.g., Database, LLM fine-tuning, LLM serving) through machine learning techniques, especially ML4Sys (especially RL4Sys). Brief Introduction: PhD Student at the Department of Computer Science and Technology, University of Cambridge, Co-Founder of Powersense.Ltd.

Miscellany