TwistNet-2D: Learning Second-Order Channel Interactions via Spiral Twisting for Texture Recognition

📅 2026-02-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge that existing texture recognition methods struggle to simultaneously preserve spatial structure and explicitly model second-order interactions among local channels. To this end, we propose TwistNet-2D, a lightweight module featuring a novel spiral-twist channel interaction mechanism. By computing pairwise channel products through four-directional spiral displacements, our method explicitly encodes cross-positional feature co-occurrence and interaction patterns. Integrated with learnable channel reweighting and a Sigmoid-gated residual connection, TwistNet-2D can be seamlessly embedded into backbone architectures such as ResNet. Despite introducing only a 3.5% increase in parameters and 2% additional FLOPs, our approach outperforms larger models—including ConvNeXt and Swin Transformer—on four benchmark datasets for texture and fine-grained visual recognition.

Technology Category

Application Category

📝 Abstract
Second-order feature statistics are central to texture recognition, yet current methods face a fundamental tension: bilinear pooling and Gram matrices capture global channel correlations but collapse spatial structure, while self-attention models spatial context through weighted aggregation rather than explicit pairwise feature interactions. We introduce TwistNet-2D, a lightweight module that computes \emph{local} pairwise channel products under directional spatial displacement, jointly encoding where features co-occur and how they interact. The core component, Spiral-Twisted Channel Interaction (STCI), shifts one feature map along a prescribed direction before element-wise channel multiplication, thereby capturing the cross-position co-occurrence patterns characteristic of structured and periodic textures. Aggregating four directional heads with learned channel reweighting and injecting the result through a sigmoid-gated residual path, \TwistNet incurs only 3.5% additional parameters and 2% additional FLOPs over ResNet-18, yet consistently surpasses both parameter-matched and substantially larger baselines -- including ConvNeXt, Swin Transformer, and hybrid CNN--Transformer architectures -- across four texture and fine-grained recognition benchmarks.
Problem

Research questions and friction points this paper is trying to address.

texture recognition
second-order statistics
channel interactions
spatial structure
feature co-occurrence
Innovation

Methods, ideas, or system contributions that make the work stand out.

second-order statistics
channel interaction
spatial displacement
texture recognition
lightweight module
🔎 Similar Papers
No similar papers found.
J
Junbo Jacob Lian
Northwestern University
F
Feng Xiong
Northwestern University
Y
Yujun Sun
Northwestern University
Kaichen Ouyang
Kaichen Ouyang
University of Science and Technology of China
Artificial IntelligenceCognitive ScienceComplex Systems
M
Mingyang Yu
Nankai University
Shengwei Fu
Shengwei Fu
Guizhou University
Rui Zhong
Rui Zhong
Hokkaido University, Information Initiative Center, Specially Appointed Assistant Professor
Computational IntelligenceEvolutionary ComputationLarge-scale OptimizationMeta/Hyper-heuristics
Y
Yujun Zhang
Jingchu University of Technology
H
Huiling Chen
Wenzhou University