Scholar

Yuzhi Zhao

Google Scholar ID: OtoqVTIAAAAJ

Ph.D., City University of Hong Kong; B.Eng., Huazhong University of Science and Technology

Low-level VisionComputational PhotographyLLMMLLM

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,463

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailyzzhao2-c@my.cityu.edu.hk CVOpen ↗GitHubOpen ↗

Publications

14 items

SoftSkill: Behavioral Compression for Contextual Adaptation

2026

Cited

Unified Context Evolution for LLM Agents

2026

Cited

Skill-Conditioned Gated Self-Distillation for LLM Reasoning

2026

Cited

Self-Induced Outcome Potential: Turn-Level Credit Assignment for Agents without Verifiers

2026

Cited

Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards

2026

Cited

Optimizing Agentic Reasoning with Retrieval via Synthetic Semantic Information Gain Reward

2026

Cited

VP-Bench: A Comprehensive Benchmark for Visual Prompting in Multimodal Large Language Models

2025

Cited

From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training

2025

Cited

Resume (English only)

Academic Achievements

Selected publications include:
- KG-RAG: Enhancing App Decision-Making via Knowledge Graph-Driven Retrieval-Augmented Generation (EMNLP, 2025)
- Revealing Biased Personality in MLLM: A Study on Personalized Image Aesthetic Assessment (EMNLP, 2025)
- SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning (ICML, 2025)
- ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation (AAAI, 2025)
- LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations (WACV, 2025)
- Modeling Dual-Exposure Quad-Bayer Patterns for Joint Denoising and Deblurring (IEEE Transactions on Image Processing, 2024)
- SVCNet: Real-time Scribble-based Video Colorization with Pyramid Networks (IEEE Transactions on Image Processing, 2023)
- HSGAN: Hyperspectral Reconstruction from RGB Images with Generative Adversarial Network (IEEE Transactions on Neural Networks and Learning Systems, 2023)
- D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration (ECCV, 2022)

Research Experience

Currently a researcher at 2012 Labs, Huawei Hong Kong Research Center, working on MLLM and AI Agent projects. The team develops multimodal content moderation system and GUI Test Agent, with research results successfully applied in Huawei’s product lines. Former student researcher at AI Imaging Group, SenseTime, working on computational photography research and projects, developed two joint deblurring and denoising systems (for RGB images and RAW images). Former student researcher at Lightspeed and Quantum Studios, Tencent IEG, working on AIGC projects (e.g., stable diffusion).

Education

Received the B.Eng. degree from School of Electronic and Information Engineering (Qiming College), Huazhong University of Science and Technology in June 2018; Ph.D. degree from Department of Electronic Engineering, City University of Hong Kong in February 2023.

Background

Broad interests in AI applications, including low-level vision and computational photography, generative models (e.g., GAN and diffusion model). Recently focuses on applications of Multimodal Large Language Model (MLLM), e.g., AI Agent.

Miscellany