MERIT: Multi-domain Efficient RAW Image Translation

📅 2026-03-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the significant domain shift in RAW images captured by different camera sensors, which arises from variations in spectral response, noise characteristics, and tonal rendering, thereby hindering downstream vision tasks. To overcome this challenge, we propose the first unified multi-domain RAW image translation framework capable of achieving high-quality conversion between arbitrary camera domains using a single model. The core innovations include a sensor-aware noise alignment mechanism and a conditional multi-scale large-kernel attention module. Additionally, we introduce MDRAW, the first benchmark dataset tailored for multi-domain RAW image translation. Experimental results demonstrate that our method improves translation quality by 5.56 dB while reducing training iterations by 80%, substantially outperforming existing approaches.

Technology Category

Application Category

📝 Abstract
RAW images captured by different camera sensors exhibit substantial domain shifts due to varying spectral responses, noise characteristics, and tone behaviors, complicating their direct use in downstream computer vision tasks. Prior methods address this problem by training domain-specific RAW-to-RAW translators for each source-target pair, but such approaches do not scale to real-world scenarios involving multiple types of commercial cameras. In this work, we introduce MERIT, the first unified framework for multi-domain RAW image translation, which leverages a single model to perform translations across arbitrary camera domains. To address domain-specific noise discrepancies, we propose a sensor-aware noise modeling loss that explicitly aligns the signal-dependent noise statistics of the generated images with those of the target domain. We further enhance the generator with a conditional multi-scale large kernel attention module for improved context and sensor-aware feature modeling. To facilitate standardized evaluation, we introduce MDRAW, the first dataset tailored for multi-domain RAW image translation, comprising both paired and unpaired RAW captures from five diverse camera sensors across a wide range of scenes. Extensive experiments demonstrate that MERIT outperforms prior models in both quality (5.56 dB improvement) and scalability (80% reduction in training iterations).
Problem

Research questions and friction points this paper is trying to address.

RAW image translation
domain shift
multi-domain
sensor-specific noise
computer vision
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-domain RAW translation
sensor-aware noise modeling
unified image translation framework
large kernel attention
MDRAW dataset
🔎 Similar Papers
No similar papers found.