Scholar

Gouki Minegishi

Google Scholar ID: sxGpoYMAAAAJ

University of Tokyo

Deep LearningInterpretability

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

Contact

Emailminegishi@weblab.t.u-tokyo.ac.jp CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

15 items

Visual Access Boundaries in Vision-Language Model Reasoning

2026

Cited

On Advantage Estimates for Max@K Policy Gradients

2026

Cited

Zipping the Thought: When and How Compressed Reasoning Data Works in LLM Post-Training

2026

Cited

Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment

2026

Cited

LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation

2026

Cited

Steering at the Source: Style Modulation Heads for Robust Persona Control

2026

Cited

Emergent Analogical Reasoning in Transformers

2026

Cited

Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models

2025

Cited

Resume (English only)

Academic Achievements

- Neurips2025: Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties
- ICML2025: Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence
- ICLR2025: Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words

Education

First-year PhD student at The University of Tokyo, mentored by Professor Yutaka Matsuo.

Background

Research interest: mechanistic interpretability, aiming to unravel the internal mechanisms that drive today's AI systems, with the ultimate goal of understanding what truly constitutes human intelligence.

Co-authors

0 total

Co-authors: 0 (list not available)