Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport

📅 2026-04-14
📈 Citations: 0
Influential: 0
📄 PDF

career value

192K/year
🤖 AI Summary
This work proposes a human-centered topic modeling approach that explicitly incorporates user intent into the modeling process, addressing the tendency of traditional topic models to generate redundant or misaligned topics due to their lack of explicit goal modeling. The method leverages large language model prompting to extract goal-relevant candidate words and integrates optimal transport with semantic-aware contrastive learning for topic discovery. Evaluated on three Reddit datasets, the approach significantly outperforms existing methods, achieving marked improvements in topic coherence, diversity, and alignment with human goals. The resulting topics are not only more interpretable and varied but also directly oriented toward user-specified objectives, demonstrating a principled advance toward purposeful and meaningful topic generation.

Technology Category

Application Category

📝 Abstract
Existing topic modeling methods, from LDA to recent neural and LLM-based approaches, which focus mainly on statistical coherence, often produce redundant or off-target topics that miss the user's underlying intent. We introduce Human-centric Topic Modeling, \emph{Human-TM}), a novel task formulation that integrates a human-provided goal directly into the topic modeling process to produce interpretable, diverse and goal-oriented topics. To tackle this challenge, we propose the \textbf{G}oal-prompted \textbf{C}ontrastive \textbf{T}opic \textbf{M}odel with \textbf{O}ptimal \textbf{T}ransport (GCTM-OT), which first uses LLM-based prompting to extract goal candidates from documents, then incorporates these into semantic-aware contrastive learning via optimal transport for topic discovery. Experimental results on three public subreddit datasets show that GCTM-OT outperforms state-of-the-art baselines in topic coherence and diversity while significantly improving alignment with human-provided goals, paving the way for more human-centric topic discovery systems.
Problem

Research questions and friction points this paper is trying to address.

topic modeling
human intent
goal-oriented topics
topic redundancy
topic relevance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Human-centric Topic Modeling
Goal-prompted Contrastive Learning
Optimal Transport
LLM-based Prompting
Topic Coherence and Diversity
🔎 Similar Papers
2024-02-12International Conference on Learning RepresentationsCitations: 6