LAKAN: Landmark-assisted Adaptive Kolmogorov-Arnold Network for Face Forgery Detection

📅 2025-10-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing CNN- and Transformer-based deepfake detection models exhibit limited capacity in modeling complex nonlinear artifacts inherent in forged facial images. To address this, we propose a novel detection framework leveraging the Kolmogorov–Arnold Network (KAN). Our method introduces two key innovations: (1) learnable spline-based activation functions to enhance local nonlinear approximation capability; and (2) geometric prior-guided parameter generation—where facial landmarks dynamically instantiate KAN module parameters, enabling structural awareness and adaptive feature modeling. Integrated into a generic image encoder architecture, the proposed approach achieves state-of-the-art performance on benchmark datasets including FaceForensics++ and Celeb-DF, improving average detection accuracy by 2.3–4.1 percentage points over prior methods. Moreover, it demonstrates superior cross-dataset generalization, confirming the effectiveness and robustness of KANs for deepfake detection.

Technology Category

Application Category

📝 Abstract
The rapid development of deepfake generation techniques necessitates robust face forgery detection algorithms. While methods based on Convolutional Neural Networks (CNNs) and Transformers are effective, there is still room for improvement in modeling the highly complex and non-linear nature of forgery artifacts. To address this issue, we propose a novel detection method based on the Kolmogorov-Arnold Network (KAN). By replacing fixed activation functions with learnable splines, our KAN-based approach is better suited to this challenge. Furthermore, to guide the network's focus towards critical facial areas, we introduce a Landmark-assisted Adaptive Kolmogorov-Arnold Network (LAKAN) module. This module uses facial landmarks as a structural prior to dynamically generate the internal parameters of the KAN, creating an instance-specific signal that steers a general-purpose image encoder towards the most informative facial regions with artifacts. This core innovation creates a powerful combination between geometric priors and the network's learning process. Extensive experiments on multiple public datasets show that our proposed method achieves superior performance.
Problem

Research questions and friction points this paper is trying to address.

Detecting face forgeries in deepfake-generated media
Modeling complex non-linear forgery artifacts effectively
Guiding detection focus to critical facial regions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses learnable splines for flexible activation functions
Integrates facial landmarks to guide network focus
Dynamically generates KAN parameters using landmark priors
🔎 Similar Papers
No similar papers found.
J
Jiayao Jiang
School of Cyber Science and Technology, University of Science and Technology of China
Siran Peng
Siran Peng
CASIA
Computer VisionImage FusionDeepfake Detection
B
Bin Liu
School of Cyber Science and Technology, University of Science and Technology of China
Qi Chu
Qi Chu
University of Science and Technology of China
Computer visionArtificial intelligence security
Nenghai Yu
Nenghai Yu
University of Science and Technology of China
Computer VisionArtificial IntelligenceInformation Hiding