ThematicPlane: Bridging Tacit User Intent and Latent Spaces for Image Generation

📅 2025-08-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
While generative AI has lowered barriers to image creation, non-expert users struggle to precisely translate implicit creative intentions—such as emotion or aesthetic style—into actionable inputs for diffusion models. Method: We propose ThematicPlane, the first interactive framework that constructs manipulable latent-space planes aligned with high-level semantic themes (e.g., “warmth”, “drama”) rather than low-level visual features. It leverages semantic embeddings and interactive visualizations to map intuitive user operations—like sliding along thematic dimensions—onto diffusion model latent variables in real time, supporting both divergent exploration and convergent editing. Contribution/Results: A user study (N=6) demonstrates that ThematicPlane significantly improves creative iteration efficiency and expressive freedom. It further validates that interpretable, semantics-aware control is critical for enhancing usability and user trust in generative tools.

Technology Category

Application Category

📝 Abstract
Generative AI has made image creation more accessible, yet aligning outputs with nuanced creative intent remains challenging, particularly for non-experts. Existing tools often require users to externalize ideas through prompts or references, limiting fluid exploration. We introduce ThematicPlane, a system that enables users to navigate and manipulate high-level semantic concepts (e.g., mood, style, or narrative tone) within an interactive thematic design plane. This interface bridges the gap between tacit creative intent and system control. In our exploratory study (N=6), participants engaged in divergent and convergent creative modes, often embracing unexpected results as inspiration or iteration cues. While they grounded their exploration in familiar themes, differing expectations of how themes mapped to outputs revealed a need for more explainable controls. Overall, ThematicPlane fosters expressive, iterative workflows and highlights new directions for intuitive, semantics-driven interaction in generative design tools.
Problem

Research questions and friction points this paper is trying to address.

Aligning image generation with nuanced creative intent
Bridging tacit user intent and system control
Enabling fluid exploration of high-level semantic concepts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Interactive thematic design plane for semantic control
Bridges tacit user intent with latent spaces
Enables fluid exploration without explicit prompts
🔎 Similar Papers
No similar papers found.
D
Daniel Lee
Adobe Inc.
N
Nikhil Sharma
Johns Hopkins University
D
Donghoon Shin
University of Washington
DaEun Choi
DaEun Choi
KAIST
Human-Computer Interaction
H
Harsh Sharma
University of Colorado
J
Jeonghwan Kim
University of Illinois at Urbana-Champaign
Heng Ji
Heng Ji
Professor of Computer Science, AICE Director, ASKS Director, UIUC, Amazon Scholar
Natural Language ProcessingLarge Language Models