π€ AI Summary
Video-based deception detection faces significant challenges due to the sparsity and transience of deceptive cues, as well as interference from identity-related noise, which hinders the learning of discriminative representations. To address these issues, this work proposes GenLie, a novel network featuring a localβglobal collaborative modeling architecture. The local component captures subtle deceptive signals through fine-grained feature extraction, while a global supervision mechanism suppresses identity-induced noise and enhances representation robustness. This dual strategy effectively mitigates semantic interference and data sparsity. Extensive experiments demonstrate that GenLie consistently outperforms state-of-the-art methods across three public datasets encompassing both high- and low-stakes deception scenarios, validating its effectiveness and strong generalization capability.
π Abstract
Video-based lie detection aims to identify deceptive behaviors from visual cues. Despite recent progress, its core challenge lies in learning sparse yet discriminative representations. Deceptive signals are typically subtle and short-lived, easily overwhelmed by redundant information, while individual and contextual variations introduce strong identity-related noise. To address this issue, we propose GenLie, a Global-Enhanced Lie Detection Network that performs local feature modeling under global supervision. Specifically, sparse and subtle deceptive cues are captured at the local level, while global supervision and optimization ensure robust and discriminative representations by suppressing identity-related noise. Experiments on three public datasets, covering both high- and low-stakes scenarios, show that GenLie consistently outperforms state-of-the-art methods. Source code is available at https://github.com/AliasDictusZ1/GenLie.