Divergent Creativity in Humans and Large Language Models

📅 2024-05-13
🏛️ arXiv.org
📈 Citations: 10
Influential: 0
📄 PDF
🤖 AI Summary
This study systematically investigates differences between large language models (LLMs) and humans in divergent creativity, with a focus on semantic diversity as a core dimension. Method: We propose the first comparable, reproducible cross-subject quantitative evaluation framework for divergent creativity, integrating cognitive psychology scales, computational semantic similarity metrics (BERTScore and Word2Vec), and diversity measures (unigram entropy and uniqueness). The framework is benchmarked on 100,000 real human behavioral responses and state-of-the-art LLMs. Contribution/Results: Results show that certain LLMs significantly outperform the human population average on divergent association and creative writing tasks—and approach the performance of highly creative individuals. The framework is publicly released, establishing a new empirical paradigm for measurable advancement of creative AI and for foundational research into the nature of human originality.

Technology Category

Application Category

📝 Abstract
The recent surge in the capabilities of Large Language Models (LLMs) has led to claims that they are approaching a level of creativity akin to human capabilities. This idea has sparked a blend of excitement and apprehension. However, a critical piece that has been missing in this discourse is a systematic evaluation of LLM creativity, particularly in comparison to human divergent thinking. To bridge this gap, we leverage recent advances in creativity science to build a framework for in-depth analysis of divergent creativity in both state-of-the-art LLMs and a substantial dataset of 100,000 humans. We found evidence suggesting that LLMs can indeed surpass human capabilities in specific creative tasks such as divergent association and creative writing. Our quantitative benchmarking framework opens up new paths for the development of more creative LLMs, but it also encourages more granular inquiries into the distinctive elements that constitute human inventive thought processes, compared to those that can be artificially generated.
Problem

Research questions and friction points this paper is trying to address.

Evaluating semantic diversity in LLMs vs humans
Comparing LLMs and human divergent thinking
Assessing AI's potential to replace human creativity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Leveraging computational creativity for semantic divergence analysis
Human-machine benchmarking framework for creative outputs
Techniques like prompt design to enhance semantic diversity
🔎 Similar Papers
No similar papers found.
A
Antoine Bellemare-Pepin
CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada; Music department, Concordia University, Montreal, QC, Canada
F
François Lespinasse
Sociology and Anthropology department, Concordia University, Montreal, QC, Canada
P
Philipp Thölke
CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada
Y
Yann Harel
CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada
K
Kory Mathewson
Mila (Quebec AI research Institute), Montreal, QC, Canada; Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada
J
Jay A. Olson
Department of Psychology, University of Toronto Mississauga, Mississauga, ON, Canada
Yoshua Bengio
Yoshua Bengio
Professor of computer science, University of Montreal, Mila, IVADO, CIFAR
Machine learningdeep learningartificial intelligence
K
Karim Jerbi
CoCo Lab, Psychology department, Université de Montréal, Montreal, QC, Canada; Mila (Quebec AI research Institute), Montreal, QC, Canada; UNIQUE Center (Quebec Neuro -AI research Center), QC, Canada