Pamba: Enhancing Global Interaction in Point Clouds via State Space Model

πŸ“… 2024-06-25
πŸ“ˆ Citations: 1
✨ Influential: 0
πŸ“„ PDF

career value

220K/year
πŸ€– AI Summary
To address high computational complexity, limited long-range dependency modeling, and poor adaptability of point cloud serialization in Transformer-based 3D point cloud semantic segmentation, this paper proposes Pambaβ€”the first state space model (SSM)-based architecture with linear complexity. Our method introduces: (1) a novel multi-path point cloud serialization strategy that explicitly satisfies the causal constraint of SSMs; (2) the ConvMamba module, which integrates convolutional operations to enhance local geometric modeling and bidirectional contextual awareness; and (3) an end-to-end point cloud feature encoding and fusion framework. Extensive experiments on ScanNet v2, ScanNet200, S3DIS, and nuScenes demonstrate consistent superiority over existing methods, achieving simultaneous gains in accuracy and efficiency. These results validate the effectiveness and scalability of SSMs for large-scale point cloud understanding.

Technology Category

Application Category

πŸ“ Abstract
Transformers have demonstrated impressive results for 3D point cloud semantic segmentation. However, the quadratic complexity of transformer makes computation costs high, limiting the number of points that can be processed simultaneously and impeding the modeling of long-range dependencies between objects in a single scene. Drawing inspiration from the great potential of recent state space models (SSM) for long sequence modeling, we introduce Mamba, an SSM-based architecture, to the point cloud domain and propose Pamba, a novel architecture with strong global modeling capability under linear complexity. Specifically, to make the disorderness of point clouds fit in with the causal nature of Mamba, we propose a multi-path serialization strategy applicable to point clouds. Besides, we propose the ConvMamba block to compensate for the shortcomings of Mamba in modeling local geometries and in unidirectional modeling. Pamba obtains state-of-the-art results on several 3D point cloud segmentation tasks, including ScanNet v2, ScanNet200, S3DIS and nuScenes, while its effectiveness is validated by extensive experiments.
Problem

Research questions and friction points this paper is trying to address.

3D Point Clouds
Semantic Segmentation
Transformer Optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

State Space Model
3D Point Cloud Segmentation
ConvMamba Enhancement
πŸ”Ž Similar Papers
No similar papers found.
Zhuoyuan Li
Zhuoyuan Li
University of Science and Technology of China (USTC)
Video CodingInter/Intra PredictionIn-Loop FilteringLearned Compression
Y
Yubo Ai
Deep Space Exploration Laboratory/School of Information Science and Technology, University of Science and Technology of China
J
Jiahao Lu
Deep Space Exploration Laboratory/School of Information Science and Technology, University of Science and Technology of China
Chuxin Wang
Chuxin Wang
University of Science and Technology of China
3D Computer Vision and 3D Object Detection
Jiacheng Deng
Jiacheng Deng
University of Science and Technology of China
Point cloud3D scene perception
H
Hanzhi Chang
Deep Space Exploration Laboratory/School of Information Science and Technology, University of Science and Technology of China
Y
Yanzhe Liang
Deep Space Exploration Laboratory/School of Information Science and Technology, University of Science and Technology of China
Wenfei Yang
Wenfei Yang
University of Science and Technology of China
Computer Vision
Shifeng Zhang
Shifeng Zhang
Institute of Automation, Chinese Academic of Sciences
Computer VisionObject DetectionFace DetectionPedestrian Detection
Tianzhu Zhang
Tianzhu Zhang
Professor, University of Science and Technology of China; previously Institute of Automation, CAS
Computer VisionPattern RecognitionMultimedia AnalysisMachine Learning