Spatial frequency information fusion network for few-shot learning

📅 2025-06-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address overfitting and poor generalization caused by data scarcity in few-shot image classification, this paper proposes SFIFNet—a novel network that explicitly fuses frequency-domain and spatial-domain information during data preprocessing for the first time. Methodologically, SFIFNet leverages frequency transforms (e.g., DCT or FFT) to extract global texture and structural priors, integrates them with multi-scale spatial features via a lightweight deep neural architecture, and further enhances robustness through conventional data augmentation. Its key contribution lies in breaking the prevailing reliance on spatial-domain representations alone, systematically exploiting discriminative and complementary features encoded in the frequency domain. Extensive experiments on standard few-shot benchmarks—including Mini-ImageNet and CUB—demonstrate that SFIFNet achieves significant improvements in classification accuracy (average gain of +2.3%) and superior cross-domain generalization capability.

Technology Category

Application Category

📝 Abstract
The objective of Few-shot learning is to fully leverage the limited data resources for exploring the latent correlations within the data by applying algorithms and training a model with outstanding performance that can adequately meet the demands of practical applications. In practical applications, the number of images in each category is usually less than that in traditional deep learning, which can lead to over-fitting and poor generalization performance. Currently, many Few-shot classification models pay more attention to spatial domain information while neglecting frequency domain information, which contains more feature information. Ignoring frequency domain information will prevent the model from fully exploiting feature information, which would effect the classification performance. Based on conventional data augmentation, this paper proposes an SFIFNet with innovative data preprocessing. The key of this method is enhancing the accuracy of image feature representation by integrating frequency domain information with spatial domain information. The experimental results demonstrate the effectiveness of this method in enhancing classification performance.
Problem

Research questions and friction points this paper is trying to address.

Improves few-shot learning by fusing spatial and frequency domain information
Addresses over-fitting and poor generalization with limited data
Enhances image feature representation accuracy for better classification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates frequency and spatial domain information
Proposes SFIFNet with innovative preprocessing
Enhances image feature representation accuracy
🔎 Similar Papers
No similar papers found.
W
Wenqing Zhao
School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi’an 710000, China
G
Guojia Xie
School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi’an 710000, China
H
Han Pan
Society of Entrepreneurs and Ecology (SEE) Foundation, Beijing 100020, China
Biao Yang
Biao Yang
Shanghai Jiao Tong University, Antai College of Economics and Management
Asset PricingClimate Finance
Weichuan Zhang
Weichuan Zhang
Full Professor, Shaanxi University of Science & Technology
Image ProcessingImage AnalysisPattern RecognitionComputer Vision