- ICLR2025: Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words
Education
First-year PhD student at The University of Tokyo, mentored by Professor Yutaka Matsuo.
Background
Research interest: mechanistic interpretability, aiming to unravel the internal mechanisms that drive today's AI systems, with the ultimate goal of understanding what truly constitutes human intelligence.