UniMark: Artificial Intelligence Generated Content Identification Toolkit

๐Ÿ“… 2025-12-13
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
To address the trust crisis and regulatory challenges surrounding AI-generated content (AIGC), existing detection tools remain fragmented and lack support for visible compliance labeling. This paper introduces the first open-source, unified toolkit for multimodal AIGC governance. It pioneers a dual-strategy framework integrating imperceptible watermarking (for copyright protection) and visible labeling (for regulatory compliance). We design a cross-modal unified abstraction engine and a standardized tri-modal evaluation benchmarkโ€”Image/Video/Audio-Bench. The toolkit incorporates multimodal feature alignment, lightweight neural watermark embedding, interpretable label rendering, and a modular inference architecture. It achieves state-of-the-art performance across all three benchmarks, delivering high accuracy, low latency, and verifiability. The toolkit is publicly released and has been deployed in real-world regulatory auditing scenarios.

Technology Category

Application Category

๐Ÿ“ Abstract
The rapid proliferation of Artificial Intelligence Generated Content has precipitated a crisis of trust and urgent regulatory demands. However, existing identification tools suffer from fragmentation and a lack of support for visible compliance marking. To address these gaps, we introduce the extbf{UniMark}, an open-source, unified framework for multimodal content governance. Our system features a modular unified engine that abstracts complexities across text, image, audio, and video modalities. Crucially, we propose a novel dual-operation strategy, natively supporting both emph{Hidden Watermarking} for copyright protection and emph{Visible Marking} for regulatory compliance. Furthermore, we establish a standardized evaluation framework with three specialized benchmarks (Image/Video/Audio-Bench) to ensure rigorous performance assessment. This toolkit bridges the gap between advanced algorithms and engineering implementation, fostering a more transparent and secure digital ecosystem.
Problem

Research questions and friction points this paper is trying to address.

Addresses fragmentation in AI content identification tools
Introduces a unified multimodal content governance framework
Supports both hidden watermarking and visible compliance marking
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified multimodal framework for content governance
Dual-operation strategy with hidden and visible marking
Standardized benchmarks for rigorous performance evaluation
๐Ÿ”Ž Similar Papers
2024-06-21Journal of Artificial Intelligence ResearchCitations: 6
2024-06-17North American Chapter of the Association for Computational LinguisticsCitations: 2
M
Meilin Li
Shanghai AI Laboratory, Shandong University
Ji He
Ji He
Guangzhou Medical University
CT Image ReconstructionDeep Learning
J
Jia Xu
Shanghai AI Laboratory
S
Shanzhe Lei
Shanghai AI Laboratory
Y
Yan Teng
Shanghai AI Laboratory
Y
Yingchun Wang
Shanghai AI Laboratory
Xuhong Wang
Xuhong Wang
Shanghai Artificial Intelligence Laboratory
LLMKnowledge SystemAI Simulation