Kohei Uehara
Scholar

Kohei Uehara

Google Scholar ID: yZFVY5cAAAAJ
The University of Tokyo
Computer Vision
Citations & Impact
All-time
Citations
101
 
H-index
6
 
i10-index
4
 
Publications
15
 
Co-authors
3
list available
Resume (English only)
Academic Achievements
  • Publications: 'WanderGuide: Indoor Map-less Robotic Guide for Exploration by Blind People' (CHI 2025), 'Content-Specific Humorous Image Captioning Using Incongruity Resolution Chain-of-Thought' (NAACL Findings 2024), 'Learning by Asking Questions for Knowledge-Based Novel Object Recognition' (IJCV 2024), 'K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition' (WACV 2023), 'Learning to Ask Informative Sub-Questions for Visual Question Answering' (CVPR 2022 Workshop), 'ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer' (WWW 2022 Workshop), 'Unsupervised Keyword Extraction for Full-sentence VQA' (NLPBT2020), 'Interactive Video Retrieval with Dialog' (CVPR 2020 Workshop). Projects: Asagi - Japanese Vision&Language Model.
Research Experience
  • April 2023 - March 2025: Assistant Professor, Machine Intelligence Lab., Research Center for Advanced Science and Technology (RCAST), The University of Tokyo; June 2023 - March 2025: Part-Time Researcher, Accessibility Lab., Miraikan (The National Museum of Emerging Science and Innovation); July 2023 - March 2025: Visiting Researcher, Machine Intelligence for Medical Engineering Team, RIKEN; April 2021 - July 2021: NVIDIA, Research Internship; February 2019 - April 2019: LINE Corporation, Machine Learning Engineer, Part time job; August 2018: Mercari, Inc. Machine Learning Engineer Internship.
Education
  • April 2020 - March 2023: Ph.D. in Information Science and Technology, The University of Tokyo, Advisor: Prof. Tatsuya Harada; April 2018 - March 2020: Master’s student, Information Science and Technology, The University of Tokyo, Advisor: Prof. Tatsuya Harada; April 2014 - March 2018: Undergraduate student, Mechano-Informatics, The University of Tokyo, Advisor: Prof. Tatsuya Harada.
Background
  • Research interests: Machine learning across vision and language, Large Language Models (LLMs) and Vision-Language Models (VLMs), Accessibility, and Human-Computer Interaction (HCI).
Miscellany
  • Currently working as a research engineer at SB Intuitions Corp.