1. NeurIPS 2025: Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM
2. ICCV 2023: High-resolution Document Shadow Removal via A Large-scale Real-world Dataset and A Frequency-aware Shadow Erasing Net
3. IJCAI 2023: A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement
4. AAAI 2024: Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion
Research Experience
PhD in Computer Science, University of Western Australia. 2024-Current. Research Assistant, University of Macau. 2022-2024. Research Assistant, Chinese Academy of Sciences, SIAT Shenzhen. 2022-2024.
Education
Currently a second-year Ph.D. student in Computer Science at the University of Western Australia (UWA), advised by Prof. Mohammed Bennamoun and Prof. Farid Boussaid, and jointly advised by Dr. Qiuhong Ke at Monash University.
Background
Research interests include Video Understanding, Multimodal Large Language Models (MLLMs), Agentic RL, Visual Reasoning, etc. Loves anime and is also looking for ACG-related topics.
Miscellany
Loves anime and is interested in ACG-related topics.