GenCAMO: Scene-Graph Contextual Decoupling for Environment-aware and Mask-free Camouflage Image-Dense Annotation Generation

📅 2026-01-03
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the scarcity of high-quality, large-scale annotated camouflage imagery that hinders dense prediction tasks such as camouflaged object detection and open-vocabulary segmentation. To overcome this limitation, we propose an environment-aware, mask-free generative framework that leverages scene graph context disentanglement to jointly synthesize realistic multimodal camouflaged images along with their dense annotations—including depth maps, attribute descriptions, and textual prompts—thereby constructing GenCAMO-DB, the first large-scale synthetic dataset for this domain. Experimental results demonstrate that models trained on our synthesized data achieve significantly improved performance in complex camouflaged scenarios, validating both the effectiveness and generalization capability of the generated data.

Technology Category

Application Category

📝 Abstract
Conceal dense prediction (CDP), especially RGB-D camouflage object detection and open-vocabulary camouflage object segmentation, plays a crucial role in advancing the understanding and reasoning of complex camouflage scenes. However, high-quality and large-scale camouflage datasets with dense annotation remain scarce due to expensive data collection and labeling costs. To address this challenge, we explore leveraging generative models to synthesize realistic camouflage image-dense data for training CDP models with fine-grained representations, prior knowledge, and auxiliary reasoning. Concretely, our contributions are threefold: (i) we introduce GenCAMO-DB, a large-scale camouflage dataset with multi-modal annotations, including depth maps, scene graphs, attribute descriptions, and text prompts; (ii) we present GenCAMO, an environment-aware and mask-free generative framework that produces high-fidelity camouflage image-dense annotations; (iii) extensive experiments across multiple modalities demonstrate that GenCAMO significantly improves dense prediction performance on complex camouflage scenes by providing high-quality synthetic data. The code and datasets will be released after paper acceptance.
Problem

Research questions and friction points this paper is trying to address.

camouflage image
dense annotation
dataset scarcity
conceal dense prediction
RGB-D camouflage object detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

GenCAMO
scene-graph contextual decoupling
mask-free generation
camouflage image synthesis
dense annotation
🔎 Similar Papers
No similar papers found.
C
Chenglizhao Chen
China University of Petroleum (East China)
Shaojiang Yuan
Shaojiang Yuan
China University of Petroleum (East China)
X
Xiaoxue Lu
China University of Petroleum (East China)
M
Mengke Song
China University of Petroleum (East China)
Jia Song
Jia Song
Assistant Professor, University of Idaho
Cybersecurity
Zhenyu Wu
Zhenyu Wu
PhD student, Xi'an Jiaotong University
natural language processing
W
Wenfeng Song
Beijing Information Science and Technology University
Shuai Li
Shuai Li
Beihang University