Parameter Aware Mamba Model for Multi-task Dense Prediction

πŸ“… 2025-11-18
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Modeling task correlations in multi-task dense prediction remains challenging due to the difficulty of jointly capturing inter-task dependencies and spatial structure. Method: This paper proposes the Parameter-Aware Mamba Model (PAMM), the first to introduce state-space models (specifically S4) into multi-task dense prediction. PAMM features a dual state-space parameter expert mechanism that explicitly incorporates task-specific priors, and employs multi-directional Hilbert scanning to enhance the sequence model’s capacity to capture 2D dense spatial structures. Contribution/Results: PAMM unifies task interaction modeling and global contextual reasoning within an end-to-end trainable framework, enabling joint optimization across tasks. Extensive experiments on NYUD-v2 and PASCAL-Context demonstrate significant improvements over state-of-the-art methods, validating both its effectiveness and generalizability.

Technology Category

Application Category

πŸ“ Abstract
Understanding the inter-relations and interactions between tasks is crucial for multi-task dense prediction. Existing methods predominantly utilize convolutional layers and attention mechanisms to explore task-level interactions. In this work, we introduce a novel decoder-based framework, Parameter Aware Mamba Model (PAMM), specifically designed for dense prediction in multi-task learning setting. Distinct from approaches that employ Transformers to model holistic task relationships, PAMM leverages the rich, scalable parameters of state space models to enhance task interconnectivity. It features dual state space parameter experts that integrate and set task-specific parameter priors, capturing the intrinsic properties of each task. This approach not only facilitates precise multi-task interactions but also allows for the global integration of task priors through the structured state space sequence model (S4). Furthermore, we employ the Multi-Directional Hilbert Scanning method to construct multi-angle feature sequences, thereby enhancing the sequence model's perceptual capabilities for 2D data. Extensive experiments on the NYUD-v2 and PASCAL-Context benchmarks demonstrate the effectiveness of our proposed method. Our code is available at https://github.com/CQC-gogopro/PAMM.
Problem

Research questions and friction points this paper is trying to address.

Modeling task interactions for multi-task dense prediction
Enhancing task interconnectivity using state space models
Improving 2D feature perception with Hilbert scanning sequences
Innovation

Methods, ideas, or system contributions that make the work stand out.

Parameter Aware Mamba Model with dual experts
State space models enhance task interconnectivity
Multi-Directional Hilbert Scanning for 2D features
πŸ”Ž Similar Papers
No similar papers found.
X
Xinzhuo Yu
School of Computer Science, Dalian University of Technology, Dalian 116081, China
Yunzhi Zhuge
Yunzhi Zhuge
Dalian University of Technology
Computer Vision
S
Sitong Gong
School of Information and Communication Engineering, Dalian University of Technology, Dalian 116081, China
L
Lu Zhang
School of Information and Communication Engineering, Dalian University of Technology, Dalian 116081, China
P
Pingping Zhang
School of Future Technology and Artificial Intelligence, Dalian University of Technology, Dalian 116081, China
H
Huchuan Lu
School of Information and Communication Engineering, Dalian University of Technology, Dalian 116081, China, also with the School of Future Technology and Artificial Intelligence, Dalian University of Technology, Dalian 116081, China