Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity

📅 2025-04-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Meme clustering on the Internet faces challenges in modeling semantic similarity due to multimodality, cultural dependency, and dynamic evolution; existing approaches largely rely on predefined databases, neglecting semantic diversity. This paper proposes the first database-free, template-matching–based adaptive clustering framework for memes. It jointly models four-dimensional semantic similarities—structural, visual, textual, and identity-related—and integrates local-global similarity metrics with unsupervised clustering. Crucially, the method eliminates dependence on annotated data or fixed template repositories, enabling semantically coherent, cross-cultural, and cross-platform meme clustering. Extensive experiments on multiple benchmark datasets demonstrate significant improvements over state-of-the-art methods, yielding clusters that are more cohesive, interpretable, and aligned with human intuition. The complete implementation is publicly available.

Technology Category

Application Category

📝 Abstract
Meme clustering is critical for toxicity detection, virality modeling, and typing, but it has received little attention in previous research. Clustering similar Internet memes is challenging due to their multimodality, cultural context, and adaptability. Existing approaches rely on databases, overlook semantics, and struggle to handle diverse dimensions of similarity. This paper introduces a novel method that uses template-based matching with multi-dimensional similarity features, thus eliminating the need for predefined databases and supporting adaptive matching. Memes are clustered using local and global features across similarity categories such as form, visual content, text, and identity. Our combined approach outperforms existing clustering methods, producing more consistent and coherent clusters, while similarity-based feature sets enable adaptability and align with human intuition. We make all supporting code publicly available to support subsequent research. Code: https://github.com/tygobl/meme-clustering
Problem

Research questions and friction points this paper is trying to address.

Clustering Internet memes for toxicity detection and virality modeling
Overcoming multimodality and cultural context challenges in meme clustering
Improving clustering accuracy with template-based multi-dimensional similarity features
Innovation

Methods, ideas, or system contributions that make the work stand out.

Template-based matching for meme clustering
Multi-dimensional similarity features integration
No need for predefined meme databases
🔎 Similar Papers
No similar papers found.