Square Superpixel Generation and Representation Learning via Granular Ball Computing

πŸ“… 2026-03-31
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limitations of existing superpixel methods, which produce irregular regions that are misaligned with regular operators such as convolutions, thereby hindering parallel computation and end-to-end deep learning. To overcome this, the study introduces granular ball computing into superpixel generation for the first time, proposing a structured superpixel representation based on multi-scale square blocks. By evaluating pixel intensity similarity to compute purity scores, the method adaptively selects high-quality square blocks for image coverage. This formulation inherently supports efficient parallel processing and integrates seamlessly into graph neural networks (GNNs) or Vision Transformers (ViTs) for end-to-end training. Experiments across multiple downstream vision tasks demonstrate that the proposed square superpixels significantly enhance performance, validating their advantages in both structured representation and computational efficiency.
πŸ“ Abstract
Superpixels provide a compact region-based representation that preserves object boundaries and local structures, and have therefore been widely used in a variety of vision tasks to reduce computational cost. However, most existing superpixel algorithms produce irregularly shaped regions, which are not well aligned with regular operators such as convolutions. Consequently, superpixels are often treated as an offline preprocessing step, limiting parallel implementation and hindering end-to-end optimization within deep learning pipelines. Motivated by the adaptive representation and coverage property of granular-ball computing, we develop a square superpixel generation approach. Specifically, we approximate superpixels using multi-scale square blocks to avoid the computational and implementation difficulties induced by irregular shapes, enabling efficient parallel processing and learnable feature extraction. For each block, a purity score is computed based on pixel-intensity similarity, and high-quality blocks are selected accordingly. The resulting square superpixels can be readily integrated as graph nodes in graph neural networks (GNNs) or as tokens in Vision Transformers (ViTs), facilitating multi-scale information aggregation and structured visual representation. Experimental results on downstream tasks demonstrate consistent performance improvements, validating the effectiveness of the proposed method.
Problem

Research questions and friction points this paper is trying to address.

superpixel
irregular shape
end-to-end optimization
parallel implementation
deep learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

square superpixel
granular ball computing
multi-scale representation
vision transformer
graph neural network
πŸ”Ž Similar Papers
No similar papers found.
Shuyin Xia
Shuyin Xia
Professor, School of Computer Science, Chongqing University of Posts and Telecommunications
Granular computingClusteringRough setsClassifiersGranular ball computing
M
Meng Yang
Chongqing University of Posts and Telecommunications, Chongqing, China
Dawei Dai
Dawei Dai
Chongqing University of Posts and Telecommunications
Deep Learning
F
Fan Chen
Chongqing University of Posts and Telecommunications, Chongqing, China
Shilin Zhao
Shilin Zhao
Vanderbilt University Medical Center
Bioinformatics
J
Junwei Han
Chongqing University of Posts and Telecommunications, Chongqing, China
X
Xinbo Gao
Xidian University, Xi’an, China
Guoyin Wang
Guoyin Wang
Chongqing University of Posts & Telecommunications
Artificial Intelligencerough setsdata miningknowledge technology
W
Wen Lu
Xidian University, Xi’an, China