Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings

📅 2025-11-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Human creativity assessment relies heavily on expert subjective scoring, which is costly, time-consuming, and lacks transparency and interpretability. Method: This work proposes an automatic, interpretable assessment framework for creativity based on children’s drawings. We introduce a multimodal multitask learning architecture that— for the first time—decouples creativity into complementary content and style dimensions. A conditional learning mechanism enables dynamic feature extraction responsive to both semantic and stylistic cues. The model jointly trains a vision backbone with expert-annotated content category labels. Results: Our method significantly outperforms existing regression-based approaches on creativity scoring. Attention visualizations generated by the model exhibit strong alignment with human expert judgments, demonstrating both high predictive accuracy and intrinsic interpretability. The framework thus offers a scalable, objective, and transparent alternative to conventional expert-based evaluation.

Technology Category

Application Category

📝 Abstract
Assessing human creativity through visual outputs, such as drawings, plays a critical role in fields including psychology, education, and cognitive science. However, current assessment practices still rely heavily on expert-based subjective scoring, which is both labor-intensive and inherently subjective. In this paper, we propose a data-driven framework for automatic and interpretable creativity assessment from drawings. Motivated by the cognitive understanding that creativity can emerge from both what is drawn (content) and how it is drawn (style), we reinterpret the creativity score as a function of these two complementary dimensions.Specifically, we first augment an existing creativity labeled dataset with additional annotations targeting content categories. Based on the enriched dataset, we further propose a multi-modal, multi-task learning framework that simultaneously predicts creativity scores, categorizes content types, and extracts stylistic features. In particular, we introduce a conditional learning mechanism that enables the model to adapt its visual feature extraction by dynamically tuning it to creativity-relevant signals conditioned on the drawing's stylistic and semantic cues.Experimental results demonstrate that our model achieves state-of-the-art performance compared to existing regression-based approaches and offers interpretable visualizations that align well with human judgments. The code and annotations will be made publicly available at https://github.com/WonderOfU9/CSCA_PRCV_2025
Problem

Research questions and friction points this paper is trying to address.

Automating subjective creativity assessment in drawings
Interpreting creativity through content and style dimensions
Developing interpretable multi-modal learning for visual creativity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-modal multi-task learning for creativity assessment
Conditional learning adapting to stylistic and semantic cues
Interpretable visualizations aligning with human judgment
🔎 Similar Papers
No similar papers found.
Zihao Lin
Zihao Lin
University of California, Davis, Department of Computer Science
Multi-modal Representation LearningNatural Language Processing
Z
Zhenshan Shi
Department of Computer Science, South China Normal University, Guang Zhou, China
S
Sasa Zhao
Department of Computer Science, South China Normal University, Guang Zhou, China
H
Hanwei Zhu
College of Computing and Data Science, Nanyang Technological University, Singapore
L
Lingyu Zhu
Department of Computer Science, City University of Hong Kong, Hong Kong SAR, China
B
Baoliang Chen
Department of Computer Science, South China Normal University, Guang Zhou, China
L
Lei Mo
Department of Computer Science, South China Normal University, Guang Zhou, China