Preprint: 'Diffusion Model for Manifold Data: Score Decomposition, Curvature and Statistical Complexity' (Manuscript upon request)
Preprint: 'A Minimalist Example of Edge-of-Stability and Progressive Sharpening'
Preprint: 'COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs'
Preprint: 'LLMs Can Generate a Better Answer by Aggregating Their Own Responses'
Publication: 'Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks', NeurIPS 2024
Publication: 'Robust Reinforcement Learning from Corrupted Human Feedback', NeurIPS 2024
Publication: 'Effective Minkowski dimension of deep nonparametric regression: function approximation and statistical theories', ICML 2023
Publication: 'Sequential information design: Markov persuasion process and its efficient reinforcement learning', ACM Conference on Economics and Computation 2022