Scholar

Wenlong Mou

Google Scholar ID: j-2RtWUAAAAJ

University of Toronto

machine learningstatisticsoptimizationapplied probability

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,074

H-index

i10-index

Publications

Co-authors

list available

Contact

No contact links provided.

Publications

8 items

What should post-training optimize? A test-time scaling law perspective

2026

Cited

Provable imitation learning for control of instability in partially-observed Vlasov--Poisson equations

2026

Cited

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

2026

Cited

Predicting and improving test-time scaling laws via reward tail-guided search

2026

Cited

Reinforcement Learning with Action-Triggered Observations

2025

Cited

Is RL fine-tuning harder than regression? A PDE learning approach for diffusion models

2025

Cited

Federated learning over physical channels: adaptive algorithms with near-optimal guarantees

2025

Cited

Statistical guarantees for continuous-time policy evaluation: blessing of ellipticity and new tradeoffs

2025

Cited

Resume (English only)

Academic Achievements

Paper on continuous-time policy evaluation accepted to SIAM Journal on Mathematics of Data Science; new paper on RL with action-triggered observations; new paper on RL fine tuning of diffusion models with function approximation; new paper on federated learning with physical communication channels; new paper on statistical guarantees for continuous-time reinforcement learning; new paper on optimal interpolation between bootstrap and rollout methods in reinforcement learning; new paper on debiasing general Z estimators.

Research Experience

Assistant Professor at the Department of Statistical Sciences, University of Toronto.

Education

Ph.D. from the Department of EECS, UC Berkeley, advised by Prof. Martin Wainwright and Prof. Peter Bartlett; B.S. in Computer Science from Peking University, advised by Prof. Liwei Wang.

Background

Research Interests: Mathematics of machine learning in the era of large AI models, including post-training optimization of generative models, reinforcement learning fine-tuning and test-time adaptation, practical structures that enable efficient reinforcement learning with function approximation, RL in continuous-time diffusion processes, stochastic approximation for large-scale machine learning, incorporation of machine learning into causal and semiparametric estimation problems. On the applied side, interested in various applications of machine learning for engineering problems in the physical world.

Miscellany