Published at NeurIPS 2024: 'Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning'
Multiple papers published at ICML 2024, including 'Factored-Reward Bandits with Intermediate Observations', 'Best Arm Identification for Stochastic Rising Bandits', etc.
Published at AISTATS 2024: 'Autoregressive Bandits'
Published at ICML 2023: 'Dynamical Linear Bandits'
Multiple papers accepted at ICML 2025 and NeurIPS 2025 (to appear)