Yushen Zuo
Scholar

Yushen Zuo

Google Scholar ID: C2CDJOoAAAAJ
The Hong Kong Polytechnic University
Computer visionDeep learningImage Generation
Citations & Impact
All-time
Citations
279
 
H-index
8
 
i10-index
8
 
Publications
16
 
Co-authors
12
list available
Resume (English only)
Academic Achievements
  • Paper '4KAgent: Agentic Any Image to 4K Super-Resolution' accepted by NeurIPS 2025
  • Paper 'Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks' accepted by ICCV 2025 (equal contribution)
  • Paper 'See In Detail: Enhancing Sparse-view 3D Gaussian Splatting with Local Depth and Semantic Regularization' accepted by ICASSP 2025
  • Paper 'Towards Multi-View Consistent Style Transfer with One-Step Diffusion via Vision Conditioning' accepted by AI4VA Workshop at ECCV 2024
  • 1st place in NTIRE 2025 Challenge on Short-form UGC Image Super-Resolution (4x) at CVPR 2025
  • 2nd place in AIM 2024 Challenge on Efficient Video Super-Resolution for AV1 Compressed Content at ECCV 2024; method FSMD featured in summary paper
  • Ranked 10/60 in NTIRE 2021 Challenge on Image Deblurring at CVPR 2021; method Visual Token Transformer included in summary paper
  • Paper 'Low-resolution palmprint image denoising by generative adversarial networks' published in Neurocomputing 2019
  • Received 'Stars-of-tomorrow' award from MSRA Internship Program in 2022
Research Experience
  • Jan 2025–present: Research Intern at Texas A&M University (TAMU), supervised by Prof. Zhengzhong Tu
  • Apr 2024–Jan 2025: Research Assistant at The Hong Kong Polytechnic University (PolyU), supervised by Prof. Kenneth K. M. Lam
  • Jul 2022 onward: Applied Scientist at Microsoft, working on recommendation systems and LLM applications in Bing
  • Jul 2021: Research Intern at Microsoft Research Asia (MSRA), Multi-Modal Interaction (MMI) Group (led by Dr. Qiang Huo), collaborating with Azure OCR team on multi-directional table detection in PDF images
  • Interned at Tencent Youtu Lab
Background
  • Research interests include image/video generation, vision-language models, low-level vision, Agentic AI, object detection and segmentation, and 3D vision.
  • Currently a research intern in the TACO group at Texas A&M University (TAMU), advised by Prof. Zhengzhong Tu, collaborating closely with Renjie Li.
  • Previously a research assistant at The Hong Kong Polytechnic University (PolyU), advised by Prof. Kenneth K. M. Lam, collaborating with Jun Xiao.
  • Former Applied Scientist at Microsoft, focusing on recommendation systems and LLM applications in Bing.
  • Interned at Microsoft Research Asia (MSRA) and Tencent Youtu Lab.
  • Actively seeking PhD and research positions worldwide.