PRISM: Progressive Rain removal with Integrated State-space Modeling

📅 2025-09-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Image deraining is critical for autonomous driving and related vision tasks, yet existing single-scale methods struggle to simultaneously recover fine-grained details and preserve global structural consistency. To address this, we propose PRISM, a progressive deraining framework comprising three stages: coarse extraction, hybrid-domain feature fusion, and fine-grained restoration. We introduce Hybrid Attention U-Net—integrating channel-wise attention and windowed Transformer modules—to enable robust multi-scale feature aggregation. Additionally, we design Hybrid Domain Mamba (HDMamba), jointly modeling spatial semantics and wavelet-domain characteristics for enhanced rain pattern discrimination. An original-resolution subnetwork is further embedded to retain high-frequency textures and sharp edges. Extensive experiments on multiple benchmarks demonstrate that PRISM achieves state-of-the-art performance, significantly improving removal of rain streaks and raindrops while maintaining global coherence and substantially enhancing texture fidelity and edge sharpness.

Technology Category

Application Category

📝 Abstract
Image deraining is an essential vision technique that removes rain streaks and water droplets, enhancing clarity for critical vision tasks like autonomous driving. However, current single-scale models struggle with fine-grained recovery and global consistency. To address this challenge, we propose Progressive Rain removal with Integrated State-space Modeling (PRISM), a progressive three-stage framework: Coarse Extraction Network (CENet), Frequency Fusion Network (SFNet), and Refine Network (RNet). Specifically, CENet and SFNet utilize a novel Hybrid Attention UNet (HA-UNet) for multi-scale feature aggregation by combining channel attention with windowed spatial transformers. Moreover, we propose Hybrid Domain Mamba (HDMamba) for SFNet to jointly model spatial semantics and wavelet domain characteristics. Finally, RNet recovers the fine-grained structures via an original-resolution subnetwork. Our model learns high-frequency rain characteristics while preserving structural details and maintaining global context, leading to improved image quality. Our method achieves competitive results on multiple datasets against recent deraining methods.
Problem

Research questions and friction points this paper is trying to address.

Removing rain streaks and water droplets from images
Addressing fine-grained recovery and global consistency issues
Modeling spatial semantics and wavelet domain characteristics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Progressive three-stage framework for rain removal
Hybrid Attention UNet for multi-scale feature aggregation
Hybrid Domain Mamba modeling spatial and wavelet domains
🔎 Similar Papers
No similar papers found.
P
Pengze Xue
Faculty of Data Science, City University of Macau, SAR Macao, China
S
Shanwen Wang
Faculty of Data Science, City University of Macau, SAR Macao, China
Fei Zhou
Fei Zhou
HAUT
deep learningtarget detectionimage processing
Y
Yan Cui
Zhuhai 4Dage Network Technology, China
X
Xin Sun
Faculty of Data Science, City University of Macau, SAR Macao, China