Frequency-aware Neural Representation for Videos

📅 2026-01-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the spectral bias inherent in existing implicit neural representation (INR) methods for video compression, which favor low-frequency components and consequently produce overly smooth reconstructions with suboptimal rate-distortion performance. To mitigate this limitation, we propose FaNeRV, a novel framework that explicitly decouples high- and low-frequency components through a frequency-aware mechanism. FaNeRV incorporates a frequency-domain decomposition network, multi-resolution supervision guidance, and a dynamic high-frequency injection strategy to effectively alleviate spectral bias. Experimental results demonstrate that FaNeRV significantly outperforms current INR-based approaches on standard benchmarks, achieving rate-distortion performance comparable to conventional video codecs while enabling efficient and high-fidelity video reconstruction.

Technology Category

Application Category

📝 Abstract
Implicit Neural Representations (INRs) have emerged as a promising paradigm for video compression. However, existing INR-based frameworks typically suffer from inherent spectral bias, which favors low-frequency components and leads to over-smoothed reconstructions and suboptimal rate-distortion performance. In this paper, we propose FaNeRV, a Frequency-aware Neural Representation for videos, which explicitly decouples low- and high-frequency components to enable efficient and faithful video reconstruction. FaNeRV introduces a multi-resolution supervision strategy that guides the network to progressively capture global structures and fine-grained textures through staged supervision . To further enhance high-frequency reconstruction, we propose a dynamic high-frequency injection mechanism that adaptively emphasizes challenging regions. In addition, we design a frequency-decomposed network module to improve feature modeling across different spectral bands. Extensive experiments on standard benchmarks demonstrate that FaNeRV significantly outperforms state-of-the-art INR methods and achieves competitive rate-distortion performance against traditional codecs.
Problem

Research questions and friction points this paper is trying to address.

spectral bias
video compression
high-frequency reconstruction
rate-distortion performance
implicit neural representations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Frequency-aware
Implicit Neural Representation
Multi-resolution Supervision
High-frequency Injection
Video Compression
🔎 Similar Papers
No similar papers found.
J
Jun Zhu
University of Chinese Academy of Sciences
Xinfeng Zhang
Xinfeng Zhang
Fuxi AI Lab, NetEase Inc.
Vision-Language ModelsMultimodal
Lv Tang
Lv Tang
University of Alberta. Former researcher @ UCAS/Nanjing University
Computer VisionMLLMVideo CompressionImage Segmentation
J
Junhao Jiang
University of Chinese Academy of Sciences
G
Gai Zhang
University of Chinese Academy of Sciences
J
Jia Wang
University of Chinese Academy of Sciences