🤖 AI Summary
This work addresses the critical limitations of large language models (LLMs) in counterargument generation—namely, low factual accuracy and uncontrolled verbosity. To tackle these issues, we propose a dynamic external knowledge integration framework that retrieves relevant evidence via real-time web search and incorporates an evidence-guided decoding control mechanism to enable empirically grounded, high-quality counterargument generation. Our contributions include: (1) constructing the first balanced, human-annotated counterargument dataset; (2) introducing an LLM-as-a-Judge automatic evaluation paradigm that achieves strong correlation with human judgments (Spearman’s ρ > 0.85); and (3) demonstrating consistent superiority over strong baselines across key metrics—including relevance, persuasiveness, and factual accuracy—on multiple benchmarks. Empirical results validate both the effectiveness and generalizability of real-time knowledge augmentation for counterargument generation.
📝 Abstract
This paper investigates the role of dynamic external knowledge integration in improving counter-argument generation using Large Language Models (LLMs). While LLMs have shown promise in argumentative tasks, their tendency to generate lengthy, potentially unfactual responses highlights the need for more controlled and evidence-based approaches. We introduce a new manually curated dataset of argument and counter-argument pairs specifically designed to balance argumentative complexity with evaluative feasibility. We also propose a new LLM-as-a-Judge evaluation methodology that shows a stronger correlation with human judgments compared to traditional reference-based metrics. Our experimental results demonstrate that integrating dynamic external knowledge from the web significantly improves the quality of generated counter-arguments, particularly in terms of relatedness, persuasiveness, and factuality. The findings suggest that combining LLMs with real-time external knowledge retrieval offers a promising direction for developing more effective and reliable counter-argumentation systems.