NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

📅 2025-04-14
📈 Citations: 2
Influential: 1
📄 PDF
🤖 AI Summary
This work addresses Cross-Domain Few-Shot Object Detection (CD-FSOD), a challenging task requiring robust generalization from minimal target-domain annotations. We propose a unified framework integrating meta-learning, feature alignment, domain adaptation, and prompt-based fine-tuning, augmented with multi-scale feature distillation and dynamic pseudo-label optimization to enhance cross-domain generalization under extreme label scarcity. To foster community advancement, we launch the first international CD-FSOD Challenge, establishing an open/closed-track evaluation paradigm to standardize benchmarking. The challenge attracted 152 registered teams, with 42 submitting valid entries; all 13 finalist methods surpassed the baseline, achieving substantial mAP gains—e.g., on PASCAL→COCO transfers—thereby yielding multiple new state-of-the-art approaches. Our work provides both conceptual advances in few-shot cross-domain transfer and a rigorous, publicly accessible benchmark platform for future research.

Technology Category

Application Category

📝 Abstract
Cross-Domain Few-Shot Object Detection (CD-FSOD) poses significant challenges to existing object detection and few-shot detection models when applied across domains. In conjunction with NTIRE 2025, we organized the 1st CD-FSOD Challenge, aiming to advance the performance of current object detectors on entirely novel target domains with only limited labeled data. The challenge attracted 152 registered participants, received submissions from 42 teams, and concluded with 13 teams making valid final submissions. Participants approached the task from diverse perspectives, proposing novel models that achieved new state-of-the-art (SOTA) results under both open-source and closed-source settings. In this report, we present an overview of the 1st NTIRE 2025 CD-FSOD Challenge, highlighting the proposed solutions and summarizing the results submitted by the participants.
Problem

Research questions and friction points this paper is trying to address.

Advancing object detection in novel domains with limited data
Addressing challenges in cross-domain few-shot object detection
Evaluating new models for state-of-the-art performance across domains
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-Domain Few-Shot Object Detection challenge
Novel models for state-of-the-art results
Limited labeled data in new domains
🔎 Similar Papers
No similar papers found.
Y
Yuqian Fu
Xingyu Qiu
Xingyu Qiu
Harbin Institute of Technology
Medical Image AnalysisGenerative AI
B
Bin Ren
Yanwei Fu
Yanwei Fu
Fudan University
Computer visionmachine learningMultimedia
Radu Timofte
Radu Timofte
Humboldt Professor for AI and Computer Vision, University of Würzburg
Computer VisionMachine LearningAICompressionComputational Photography
Nicu Sebe
Nicu Sebe
University of Trento
computer visionmultimedia
Ming-Hsuan Yang
Ming-Hsuan Yang
University of California at Merced; Google DeepMind
Computer VisionMachine LearningArtificial Intelligence
Luc Van Gool
Luc Van Gool
professor computer vision INSAIT Sofia University, em. KU Leuven, em. ETHZ, Toyota Lab TRACE
computer visionmachine learningAIautonomous carscultural heritage
K
Kaijin Zhang
Q
Qingpeng Nong
X
Xiugang Dong
Hong Gao
Hong Gao
Zhejiang Normal University
DatabaseInternet of Things
X
Xiangsheng Zhou
Jiancheng Pan
Jiancheng Pan
INSAIT, Sofia University "St. Kliment Ohridski"; R.A., THU; M.S., ZJUT
Multimodal LearningFoundation ModelsMLLMsData-centric AIAI4Earth
Yanxing Liu
Yanxing Liu
University of Chinese Academy of Sciences
Multimodal perceptionRemote Sensing Object DetectionFew-shot learning
X
Xiao He
J
Jiahao Li
Yuze Sun
Yuze Sun
Tsinghua University, doc
Deep LearningAI for Earth
Xiaomeng Huang
Xiaomeng Huang
Tsinghua University
Earth System ModelHPCBig DataAI
Z
Zhenyu Zhang
R
Ran Ma
Y
Yuhan Liu
Z
Zijian Zhuang
Shuai Yi
Shuai Yi
Yixiong Zou
Yixiong Zou
Huazhong University of Science and Technology
Computer visionDomain generalizationFew-shot learningVision-language model
Lingyi Hong
Lingyi Hong
Fudan University
Computer Vision
M
Mingxi Chen
R
Runze Li
X
Xingdong Sheng
W
Wenqiang Zhang
W
Weisen Chen
Y
Yongxin Yan
X
Xinguo Chen
Yuanjie Shao
Yuanjie Shao
School of Electronic Information and Communications, Huazhong University of Science and Technology
few-shot learning、low-level image processing
Z
Zhengrong Zuo
Nong Sang
Nong Sang
Huazhong University of Science and Technology
Computer Vision and Pattern Recognition
H
Hao Wu
H
Haoran Sun
Shuming Hu
Shuming Hu
Research Engineer, Meta
machine learningphysics
Y
Yan Zhang
Z
Zhiguang Shi
Y
Yu Zhang
C
Chao Chen
T
Tao Wang
D
Da Feng
Linhai Zhuo
Linhai Zhuo
Fuzhou University
Computer VisionFew-shot Learning
Z
Ziming Lin
Yali Huang
Yali Huang
J
Jie Me
Y
Yiming Yang
M
Mi Guo
M
Mingyuan Jiu
M
Mingliang Xu
M
Maomao Xiong
Q
Qunshu Zhang
X
Xinyu Cao
Y
Yuqing Yang
D
Dianmo Sheng
X
Xuanpu Zhao
Zhiyu Li
Zhiyu Li
Tianjin University
Robust controlattitude control
X
Xuyang Ding
W
Wenqian Li