π€ AI Summary
Modeling task correlations in multi-task dense prediction remains challenging due to the difficulty of jointly capturing inter-task dependencies and spatial structure. Method: This paper proposes the Parameter-Aware Mamba Model (PAMM), the first to introduce state-space models (specifically S4) into multi-task dense prediction. PAMM features a dual state-space parameter expert mechanism that explicitly incorporates task-specific priors, and employs multi-directional Hilbert scanning to enhance the sequence modelβs capacity to capture 2D dense spatial structures. Contribution/Results: PAMM unifies task interaction modeling and global contextual reasoning within an end-to-end trainable framework, enabling joint optimization across tasks. Extensive experiments on NYUD-v2 and PASCAL-Context demonstrate significant improvements over state-of-the-art methods, validating both its effectiveness and generalizability.
π Abstract
Understanding the inter-relations and interactions between tasks is crucial for multi-task dense prediction. Existing methods predominantly utilize convolutional layers and attention mechanisms to explore task-level interactions. In this work, we introduce a novel decoder-based framework, Parameter Aware Mamba Model (PAMM), specifically designed for dense prediction in multi-task learning setting. Distinct from approaches that employ Transformers to model holistic task relationships, PAMM leverages the rich, scalable parameters of state space models to enhance task interconnectivity. It features dual state space parameter experts that integrate and set task-specific parameter priors, capturing the intrinsic properties of each task. This approach not only facilitates precise multi-task interactions but also allows for the global integration of task priors through the structured state space sequence model (S4). Furthermore, we employ the Multi-Directional Hilbert Scanning method to construct multi-angle feature sequences, thereby enhancing the sequence model's perceptual capabilities for 2D data. Extensive experiments on the NYUD-v2 and PASCAL-Context benchmarks demonstrate the effectiveness of our proposed method. Our code is available at https://github.com/CQC-gogopro/PAMM.