🤖 AI Summary
This study addresses the problem of subjective, bias-prone, and error-sensitive expert judgments in post-generation creative quality assessment. Methodologically, it proposes a computationally tractable and reproducible automated evaluation framework that integrates large language model (LLM)-derived creative embeddings, UMAP-based high-dimensional manifold dimensionality reduction, DBSCAN density-based clustering, and vector similarity metrics to quantitatively assess creative diversity and quality. Its key contribution lies in pioneering the application of manifold learning and unsupervised clustering to creative evaluation—departing from conventional paradigms. Experimental results demonstrate that the framework achieves over 92% agreement with expert consensus across multiple benchmarks, significantly improving screening efficiency and cross-user consistency. Notably, it empowers novice designers to perform professional-grade creative filtering without domain expertise.
📝 Abstract
The demand for innovation in product design necessitates a prolific ideation phase. Conversational AI (CAI) systems that use Large Language Models (LLMs) such as GPT (Generative Pre-trained Transformer) have been shown to be fruitful in augmenting human creativity, providing numerous novel and diverse ideas. Despite the success in ideation quantity, the qualitative assessment of these ideas remains challenging and traditionally reliant on expert human evaluation. This method suffers from limitations such as human judgment errors, bias, and oversight. Addressing this gap, our study introduces a comprehensive mathematical framework for automated analysis to objectively evaluate the plethora of ideas generated by CAI systems and/or humans. This framework is particularly advantageous for novice designers who lack experience in selecting promising ideas. By converting the ideas into higher dimensional vectors and quantitatively measuring the diversity between them using tools such as UMAP, DBSCAN and PCA, the proposed method provides a reliable and objective way of selecting the most promising ideas, thereby enhancing the efficiency of the ideation phase.