🤖 AI Summary
Fine-grained modeling of sarcasm remains challenging due to semantic incongruity, affective intensity, and strong contextual dependency—particularly in recognition and generation across diverse sarcastic subtypes.
Method: We introduce Sarc7, the first benchmark dataset covering seven sarcasm categories (self-deprecating, reflective, deadpan, polite, disgusted, angry, manic) and propose an affect-aware prompting paradigm that explicitly integrates this seven-dimensional sarcasm typology with explicit emotional cues into zero-shot and few-shot recognition and generation. Our approach combines affect-enhanced prompt engineering, incongruity modeling, and context-aware generation strategies.
Results: On Sarc7, Gemini 2.5 achieves an F1 score of 0.3664 under emotion-aware prompting—the highest among all settings. Human evaluation shows a 38.46% improvement in sarcastic utterance generation success over zero-shot baselines. This work establishes an interpretable, scalable framework for fine-grained sarcasm understanding and generation.
📝 Abstract
Sarcasm is a form of humor where expressions convey meanings opposite to their literal interpretations. Classifying and generating sarcasm using large language models is vital for interpreting human communication. Sarcasm poses challenges for computational models, due to its nuanced nature. We introduce Sarc7, a benchmark that classifies 7 types of sarcasm: self-deprecating, brooding, deadpan, polite, obnoxious, raging, and manic by annotating entries of the MUStARD dataset. Classification was evaluated using zero-shot, few-shot, chain-of-thought (CoT), and a novel emotion-based prompting technique. We propose an emotion-based generation method developed by identifying key components of sarcasm-incongruity, shock value, and context dependency. Our classification experiments show that Gemini 2.5, using emotion-based prompting, outperforms other setups with an F1 score of 0.3664. Human evaluators preferred our emotion-based prompting, with 38.46% more successful generations than zero-shot prompting.