BVI-Mamba: Video Enhancement Using a Visual State-Space Model for Low-Light and Underwater Environments

📅 2026-04-26
📈 Citations: 0
Influential: 0
📄 PDF

career value

218K/year
🤖 AI Summary
This work addresses the degradation of visibility and performance in downstream vision tasks caused by noise, low contrast, color imbalance, and blur commonly found in low-light and underwater videos. To tackle these challenges, the authors propose Visual Mamba, a novel framework that introduces visual state space models (VSS) into video enhancement for the first time. Built upon a lightweight UNet-style architecture, Visual Mamba replaces conventional convolutional layers with VSS blocks to enable precise spatiotemporal feature alignment and effective image quality enhancement. The method achieves superior performance over existing Transformer- and convolution-based approaches on both low-light and underwater video enhancement tasks, while significantly reducing computational and memory overhead.

Technology Category

Application Category

📝 Abstract
Videos captured in low-light and underwater conditions often suffer from distortions such as noise, low contrast, color imbalance, and blur. These issues not only limit visibility but also degrade automatic tasks like detection. Post-processing is typically required but can be time-consuming. AI-based tools for video enhancement also demand significantly more computational resources compared to image-based methods. This paper introduces a novel framework, Visual Mamba, designed to reduce memory usage and computational time by leveraging the Visual State Space (VSS) model. The framework consists of two modules: (i) a feature alignment module, where spatio-temporal displacement between input frames is registered in the feature space, and (ii) an enhancement module, where noise removal and brightness adjustment are performed using a UNet-like architecture, with all convolutional layers replaced by VSS blocks. Experimental results show that the Visual Mamba technique outperforms Transformer and convolution-based models in both low-light and underwater video enhancement tasks. Code is available on line at https://github.com/russellllaputa/BVI-Mamba.
Problem

Research questions and friction points this paper is trying to address.

low-light video
underwater video
video enhancement
image distortion
computational efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Visual State-Space Model
Video Enhancement
Low-Light Imaging
Underwater Vision
Efficient Neural Architecture
🔎 Similar Papers
No similar papers found.