A Survey of Pun Generation: Datasets, Evaluations and Methodologies

📅 2025-07-07
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Pun generation research lacks a systematic survey, hindering field advancement. This paper presents the first comprehensive review of computational pun generation: it establishes a unified technical taxonomy encompassing rule-based approaches, Seq2Seq models, and prompt-based pretrained language models; systematically catalogs major datasets (e.g., Pun of the Day, PunGenerator) and evaluation methodologies—distinguishing automated metrics (e.g., BLEU, Punscore) from human evaluation criteria; and critically analyzes core challenges in semantic coherence, controllability of pun mechanisms (e.g., homophonic/homographic), and lexical/structural diversity. By synthesizing over two decades of work, this survey fills a critical scholarly gap, introduces standardized evaluation benchmarks, and outlines a clear technology evolution roadmap. It thereby lays the foundation for future research on interpretable, controllable, and high-quality pun generation.

Technology Category

Application Category

📝 Abstract
Pun generation seeks to creatively modify linguistic elements in text to produce humour or evoke double meanings. It also aims to preserve coherence and contextual appropriateness, making it useful in creative writing and entertainment across various media and contexts. Although pun generation has received considerable attention in computational linguistics, there is currently no dedicated survey that systematically reviews this specific area. To bridge this gap, this paper provides a comprehensive review of pun generation datasets and methods across different stages, including conventional approaches, deep learning techniques, and pre-trained language models. Additionally, we summarise both automated and human evaluation metrics used to assess the quality of pun generation. Finally, we discuss the research challenges and propose promising directions for future work.
Problem

Research questions and friction points this paper is trying to address.

Lack of systematic survey on pun generation research
Need to review datasets and methods for pun generation
Challenges in evaluating pun quality and future directions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Comprehensive review of pun generation datasets
Analysis of deep learning and language models
Summary of automated and human evaluation metrics
🔎 Similar Papers
No similar papers found.
Y
Yuchen Su
School of Computer Science, University of Auckland, New Zealand
Y
Yonghua Zhu
Singapore University of Technology and Design, Singapore
R
Ruofan Wang
School of Computer Science, University of Auckland, New Zealand
Zijian Huang
Zijian Huang
ECE PhD Candidate, University of Michigan
LLM/VLMSecurityRLMR
D
Diana Benavides-Prado
School of Electronic Engineering and Computer Science, Queen Mary University of London
Michael Witbrock
Michael Witbrock
Professor of Computer Science, Waipapa Taumata Rau: The University of Auckland
Artificial IntelligenceReasoningDeep LearningRepresentation LearningNatural Language Understanding