NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

📅 2025-04-17
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Practical deployment of image quality assessment (IQA) and enhancement for short-form user-generated content (S-UGC) on platforms like Kwai and TikTok faces bottlenecks due to computational overhead and lack of representative benchmarks. Method: (1) We introduce KwaiSR—the first large-scale, S-UGC-oriented super-resolution dataset, comprising both synthetic and real-world short-video images; (2) we propose a novel, ensemble-free, low-parameter visual quality assessment (VQA) paradigm that eliminates reliance on redundant backbone networks typical in conventional IQA/VQA; (3) we integrate a lightweight neural network with a diffusion model to achieve end-to-end, quality-aware image super-resolution. Contribution/Results: Our framework advances the practical deployment of S-UGC quality assessment and enhancement, significantly improving perceptual quality and user experience. The code and KwaiSR dataset are publicly released, catalyzing a challenge with 266 participating teams; 18 submitted solutions empirically validate the effectiveness and generalizability of our approach.

Technology Category

Application Category

📝 Abstract
This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement. The challenge comprises two tracks: (i) Efficient Video Quality Assessment (KVQ), and (ii) Diffusion-based Image Super-Resolution (KwaiSR). Track 1 aims to advance the development of lightweight and efficient video quality assessment (VQA) models, with an emphasis on eliminating reliance on model ensembles, redundant weights, and other computationally expensive components in the previous IQA/VQA competitions. Track 2 introduces a new short-form UGC dataset tailored for single image super-resolution, i.e., the KwaiSR dataset. It consists of 1,800 synthetically generated S-UGC image pairs and 1,900 real-world S-UGC images, which are split into training, validation, and test sets using a ratio of 8:1:1. The primary objective of the challenge is to drive research that benefits the user experience of short-form UGC platforms such as Kwai and TikTok. This challenge attracted 266 participants and received 18 valid final submissions with corresponding fact sheets, significantly contributing to the progress of short-form UGC VQA and image superresolution. The project is publicly available at https://github.com/lixinustc/KVQE- ChallengeCVPR-NTIRE2025.
Problem

Research questions and friction points this paper is trying to address.

Develop lightweight video quality assessment models
Enhance short-form UGC videos via super-resolution
Improve user experience on platforms like TikTok
Innovation

Methods, ideas, or system contributions that make the work stand out.

Lightweight efficient video quality assessment models
Diffusion-based super-resolution for UGC images
New dataset for short-form UGC enhancement
X
Xin Li
K
Kun Yuan
Fengbin Guan
Fengbin Guan
University of Science and Technology of China
Image/Video Quality Assessment VLM
Zihao Yu
Zihao Yu
University of Science and Technology of China
Yiting Lu
Yiting Lu
University of Science and Technology of China
VLM,Self-evolving Agent,Reasoning Model
W
Wei Luo
M
Ming Sun
C
Chao Zhou
Z
Zhibo Chen
R
R. Timofte
Bingchen Li
Bingchen Li
USTC
Y
Yizhen Shao
X
Xijun Wang
S
Suhang Yao
Y
Yabin Zhang
A
Ao-Xiang Zhang
T
Tianwu Zhi
J
Jianzhao Liu
Y
Yang Li
J
Jingwen Xu
Yiting Liao
Yiting Liao
Staff Research Scientist at Wireless Communications Research, Intel Labs
Video ProcessingVideo CommunicationsVideo Understanding
Yushen Zuo
Yushen Zuo
The Hong Kong Polytechnic University
Computer visionDeep learningImage Generation
Mingyang Wu
Mingyang Wu
Texas A&M University
Generative AI
R
Renjie Li
S
Shengyun Zhong
Zhengzhong Tu
Zhengzhong Tu
Texas A&M University, Google Research, University of Texas at Austin
Agentic AITrustworthy AIEmbodied AI
Yufan Liu
Yufan Liu
Institute of Automation, Chinese Academy of Sciences
Image/video processingKnowledge DistillationSaliency detectionModel compressionVideo coding
X
Xiangguang Chen
Z
Zuowei Cao
Minhao Tang
Minhao Tang
S
Shan Liu
Kexin Zhang
Kexin Zhang
Tsinghua University
Data MiningMachine Learning
J
Jingfen Xie
Y
Yan Wang
K
Kai Chen
S
Shijie Zhao
Yunchen Zhang
Yunchen Zhang
X
Xiangkai Xu
Hong Gao
Hong Gao
Zhejiang Normal University
DatabaseInternet of Things
Ji Shi
Ji Shi
Peking University
Computer Vision3D VisionNeural Rendering
Y
Yiming Bao
X
Xiugang Dong
X
Xiangsheng Zhou
Y
Yaofeng Tu
Ying Liang
Ying Liang
Y
Yiwen Wang
Xinning Chai
Xinning Chai
Shanghai Jiao Tong University
low-level vision
Y
Yuxuan Zhang
Zhengxue Cheng
Zhengxue Cheng
Assistant Researcher, Shanghai Jiao Tong University
Video and Image CodingComputer VisionImage Quality Assessment
Y
Yingsheng Qin
Y
Yucai Yang
R
Rong Xie
Li Song
Li Song
Professor of Electronic Engineering, Shanghai Jiao Tong University
Video CodingImage ProcessingComputer Vision
W
Wei Sun
K
Kang Fu
Linhan Cao
Linhan Cao
Shanghai Jiao Tong University
Image Quality Assessment Video Quality Assessment
D
Dandan Zhu
K
Kaiwei Zhang
Yucheng Zhu
Yucheng Zhu
Shanghai Jiaotong University
Multimedia Signal Processing
Z
Zicheng Zhang
Menghan Hu
Menghan Hu
East China Normal University
Signal processingMedical imagingHyperspectral imagingImage processingAgricultural
X
Xiongkuo Min
Guangtao Zhai
Guangtao Zhai
Professor, IEEE Fellow, Shanghai Jiao Tong University
Multimedia Signal ProcessingVisual Quality AssessmentQoEAI EvaluationDisplays
Zhi Jin
Zhi Jin
Sun Yat-Sen University, Associate Professor
J
Jiawei Wu
W
Wei Wang
W
Wenjian Zhang
Y
Yuhai Lan
G
Gaoxiong Yi
H
Hengyuan Na
W
Wang Luo
D
Di Wu
M
MingYin Bai
J
Jiawang Du
Z
Zilong Lu
Zhenyu Jiang
Zhenyu Jiang
Research, Amazon
Computer visionrobotics
H
Hui Zeng
Z
Ziguan Cui
Zongliang Gan
Zongliang Gan
G
Guijin Tang
X
Xinglin Xie
K
Kehuan Song
X
Xiaoqiang Lu
Licheng Jiao
Licheng Jiao
Distinguished Professor of Xidian University, IEEE Fellow
Neural NetworksComputational IntelligenceEvolutionary ComputationRemote SensingPattern Recognition.
F
Fang Liu
X
Xu Liu
P
Puhua Chen
H
Ha Thu Nguyen
K
Katrien De Moor
Seyed Ali Amirshahi
Seyed Ali Amirshahi
Norwegian University of Science and Technology
Image ProcessingComputer VisionImage Quality Assessmentvideo Quality AssessmentComputational
M
Mohamed-Chaker Larabi
Qi Tang
Qi Tang
Computational Science and Engineering, Georgia Institute of Technology
High Performance ComputingApplied MathematicsPlasma PhysicsScientific Machine Learning
L
Linfeng He
Z
Zhiyong Gao
Z
Zixuan Gao
G
Guohua Zhang
Z
Zhiye Huang
Yi Deng
Yi Deng
Q
Qingmiao Jiang
L
Lu Chen
Y
Yi Yang
X
Xi Liao
N
Nourine Mohammed Nadir
Y
Yuxuan Jiang
Q
Qiang Zhu
S
Siyue Teng
F
Fan Zhang
Shuyuan Zhu
Shuyuan Zhu
Associate Professor of University of Electronic Science and Technology of China
Signa ProcessingImage/Video Compression
Bing Zeng
Bing Zeng
University of Electronic Science and Technology of China
Image and video processing
David Bull
David Bull
Professor of Signal Processing, Director Bristol Vision Insitute, University of Bristol
imagevideosignal processingcommunicationsoptimisation
M
Mei-Fang Liu
Chao Yao
Chao Yao
Northwestern polytechnical university
Y
Yao Zhao