- Bingrui Li, Wei Huang, Andi Han, Zhanpeng Zhou, Taiji Suzuki, Jun Zhu, Jianfei Chen: On the Optimization and Generalization of Two-layer Transformers with Sign Gradient Descent. ICLR 2025.
- Juno Kim, Taiji Suzuki: Transformers Provably Solve Parity Efficiently with Chain of Thought. ICLR 2025. Oral presentation (1.82% of all submissions; 213/11672).
- Juno Kim, Dimitri Meunier, Arthur Gretton, Taiji Suzuki, Zhu Li: Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression. ICLR 2025.
- 2009-2013: Assistant Professor (or Research Associate, with no students) at The University of Tokyo
- 2013-2017: Associate Professor at Tokyo Institute of Technology
- 2017-Present: Associate Professor at The University of Tokyo
- 2014-2018: Sakigake (PRESTO), JST
Education
- Bachelor of Engineering from Mathematical Engineering Course, Department of Mathematical Engineering and Information Physics, Faculty of Engineering, The University of Tokyo, 2004.
- Master of Information Science and Technology from Department of Mathematical Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 2006. (supervisor: Professor Kazuyuki Aihara)
- Ph.D. of Information Science and Technology from Department of Mathematical Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 2009. (thesis advisor: Associate Professor Fumiyasu Komaki)
Background
Research interests include machine learning and statistics, particularly statistical learning theory, deep learning, kernel methods, nonparametric convergence analysis, optimization, stochastic optimization, optimization for deep learning, information geometry, prior selection, and objective Bayes.