Research Interests: Reward Modeling, Reinforcement Learning & Imitation Learning, Learning with Weak Supervision. Biography: Currently a postdoctoral researcher in the Imperfect Information Learning Team at RIKEN Center for Advanced Intelligence Project (AIP), led by Professor Masashi Sugiyama.
Miscellany
Recent activities include attending workshops between the University of Melbourne and RIKEN-AIP in Melbourne (July 3–4, 2025) and the University of Sydney and RIKEN-AIP in Sydney (July 7–8, 2025).