🤖 AI Summary
This study addresses the coarse-grained and uninterpretable nature of laughter detection in conversational AI. We introduce the first annotated dataset for humor propensity in spontaneous Japanese textual dialogues and propose a fine-grained, interpretable taxonomy of ten laughter-inducing mechanisms (e.g., empathic affiliation, humorous surprise). Methodologically, we combine multi-annotator consensus labeling with GPT-4–assisted explanation generation, followed by human-led clustering to derive the taxonomy—thereby moving beyond conventional binary laughter detection. To our knowledge, this is the first attribution framework specifically designed for spontaneous dialogue. Experimental results show that GPT-4 achieves 43.14% F1 on humor identification, demonstrating both the practical utility of our taxonomy for enhancing model performance and its potential for cross-task transfer.
📝 Abstract
Laughter serves as a multifaceted communicative signal in human interaction, yet its identification within dialogue presents a significant challenge for conversational AI systems. This study addresses this challenge by annotating laughable contexts in Japanese spontaneous text conversation data and developing a taxonomy to classify the underlying reasons for such contexts. Initially, multiple annotators manually labeled laughable contexts using a binary decision (laughable or non-laughable). Subsequently, an LLM was used to generate explanations for the binary annotations of laughable contexts, which were then categorized into a taxonomy comprising ten categories, including"Empathy and Affinity"and"Humor and Surprise,"highlighting the diverse range of laughter-inducing scenarios. The study also evaluated GPT-4's performance in recognizing the majority labels of laughable contexts, achieving an F1 score of 43.14%. These findings contribute to the advancement of conversational AI by establishing a foundation for more nuanced recognition and generation of laughter, ultimately fostering more natural and engaging human-AI interactions.