Zinuo Li
Scholar

Zinuo Li

Google Scholar ID: ibTrzq8AAAAJ
University of Western Australia
MLLMMultimodalVideo Understanding
Citations & Impact
All-time
Citations
207
 
H-index
6
 
i10-index
6
 
Publications
9
 
Co-authors
0
 
Resume (English only)
Academic Achievements
  • 1. NeurIPS 2025: Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
  • 2. ICCV 2023: High-resolution Document Shadow Removal via A Large-scale Real-world Dataset and A Frequency-aware Shadow Erasing Net
  • 3. IJCAI 2023: A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement
  • 4. AAAI 2024: Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion
Research Experience
  • PhD in Computer Science, University of Western Australia. 2024-Current. Research Assistant, University of Macau. 2022-2024. Research Assistant, Chinese Academy of Sciences, SIAT Shenzhen. 2022-2024.
Education
  • Currently a second-year Ph.D. student in Computer Science at the University of Western Australia (UWA), advised by Prof. Mohammed Bennamoun and Prof. Farid Boussaid, and jointly advised by Dr. Qiuhong Ke at Monash University.
Background
  • Research interests include Video Understanding, Multimodal Large Language Models (MLLMs), Agentic RL, Visual Reasoning, etc. Loves anime and is also looking for ACG-related topics.
Miscellany
  • Loves anime and is interested in ACG-related topics.
Co-authors
0 total
Co-authors: 0 (list not available)