StyleDecoupler: Generalizable Artistic Style Disentanglement

📅 2026-01-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of effectively decoupling artistic style from semantic content, which are typically highly entangled. To this end, we propose StyleDecoupler, a training-free, general-purpose framework that leverages information theory to disentangle pure style representations. Specifically, by minimizing mutual information, our method aligns features between a frozen multimodal vision-language model—encoding both style and content—and a unimodal model that emphasizes content-invariant characteristics. This plug-and-play approach requires no fine-tuning and introduces WeART, a large-scale benchmark for artistic style analysis. Evaluated on both WeART and WikiArt, StyleDecoupler achieves state-of-the-art performance in style retrieval, style relationship mapping, and evaluation of generative models.

Technology Category

Application Category

📝 Abstract
Representing artistic style is challenging due to its deep entanglement with semantic content. We propose StyleDecoupler, an information-theoretic framework that leverages a key insight: multi-modal vision models encode both style and content, while uni-modal models suppress style to focus on content-invariant features. By using uni-modal representations as content-only references, we isolate pure style features from multi-modal embeddings through mutual information minimization. StyleDecoupler operates as a plug-and-play module on frozen Vision-Language Models without fine-tuning. We also introduce WeART, a large-scale benchmark of 280K artworks across 152 styles and 1,556 artists. Experiments show state-of-the-art performance on style retrieval across WeART and WikiART, while enabling applications like style relationship mapping and generative model evaluation. We release our method and dataset at this url.
Problem

Research questions and friction points this paper is trying to address.

artistic style
style disentanglement
content-style entanglement
style representation
vision-language models
Innovation

Methods, ideas, or system contributions that make the work stand out.

style disentanglement
mutual information minimization
vision-language models
plug-and-play module
artistic style representation
🔎 Similar Papers
No similar papers found.