DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization

📅 2026-03-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses key challenges in video and audio manipulation localization—namely, ambiguous boundaries, sparse tampering patterns, and insufficient long-range modeling—by proposing DeformTrace, a novel framework that synergistically combines the global modeling capacity of Transformers with the computational efficiency of State Space Models (SSMs). DeformTrace introduces deformable SSMs (DS-SSM/DC-SSM) to dynamically adapt receptive fields, incorporates relay tokens to mitigate long-range dependency decay, and designs a query-aware subspace mechanism to enhance sensitivity to sparse manipulations. Despite using fewer parameters and achieving faster inference, DeformTrace attains state-of-the-art accuracy and robustness on temporal forgery localization benchmarks.

Technology Category

Application Category

📝 Abstract
Temporal Forgery Localization (TFL) aims to precisely identify manipulated segments in video and audio, offering strong interpretability for security and forensics. While recent State Space Models (SSMs) show promise in precise temporal reasoning, their use in TFL is hindered by ambiguous boundaries, sparse forgeries, and limited long-range modeling. We propose DeformTrace, which enhances SSMs with deformable dynamics and relay mechanisms to address these challenges. Specifically, Deformable Self-SSM (DS-SSM) introduces dynamic receptive fields into SSMs for precise temporal localization. To further enhance its capacity for temporal reasoning and mitigate long-range decay, a Relay Token Mechanism is integrated into DS-SSM. Besides, Deformable Cross-SSM (DC-SSM) partitions the global state space into query-specific subspaces, reducing non-forgery information accumulation and boosting sensitivity to sparse forgeries. These components are integrated into a hybrid architecture that combines the global modeling of Transformers with the efficiency of SSMs. Extensive experiments show that DeformTrace achieves state-of-the-art performance with fewer parameters, faster inference, and stronger robustness.
Problem

Research questions and friction points this paper is trying to address.

Temporal Forgery Localization
State Space Models
Deformable Dynamics
Relay Tokens
Sparse Forgeries
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deformable State Space Model
Temporal Forgery Localization
Relay Token Mechanism
Dynamic Receptive Field
Sparse Forgery Detection
🔎 Similar Papers
No similar papers found.
Xiaodong Zhu
Xiaodong Zhu
Professor of Economics, Faculty of Business and Economics, University of Hong Kong
MacroeconomicsGrowth and DevelopmentChinese Economy
S
Suting Wang
National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, China
Y
Yuanming Zheng
National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, China
Junqi Yang
Junqi Yang
Undergraduate Research Assistant, Huazhong University of Science and Technology
Artificial Intelligence
Y
Yangxu Liao
National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, China
Y
Yuhong Yang
National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, China
Weiping Tu
Weiping Tu
Wuhan University, Wuhan City, Hubei Prov., China
audio signal processingartificial intelligence
Zhongyuan Wang
Zhongyuan Wang
Wuhan University