A Survey on Automatic Credibility Assessment of Textual Credibility Signals in the Era of Large Language Models

📅 2024-10-28
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
The proliferation of social media and generative AI has intensified the need for automated content credibility assessment; however, existing research remains fragmented, lacking systematic integration and unified modeling of multi-source credibility signals—including factual accuracy, bias, persuasive techniques, logical fallacies, and claim veracity. This paper systematically reviews 175 studies and introduces the first comprehensive taxonomy covering nine categories of textual credibility signals, organized along four core dimensions: factual accuracy, subjectivity/bias, persuasion/logical fallacies, and claim veracity. We propose a novel LLM-driven credibility assessment paradigm that synergistically integrates NLP, explainable AI, multidimensional signal modeling, and advanced prompting/fine-tuning techniques. Our work establishes the most comprehensive literature map of credibility signal assessment to date, identifies key technical bottlenecks, and reveals new opportunities enabled by generative AI—thereby providing both a theoretical framework and practical guidelines for trustworthy NLP.

Technology Category

Application Category

📝 Abstract
In the current era of social media and generative AI, an ability to automatically assess the credibility of online social media content is of tremendous importance. Credibility assessment is fundamentally based on aggregating credibility signals, which refer to small units of information, such as content factuality, bias, or a presence of persuasion techniques, into an overall credibility score. Credibility signals provide a more granular, more easily explainable and widely utilizable information in contrast to currently predominant fake news detection, which utilizes various (mostly latent) features. A growing body of research on automatic credibility assessment and detection of credibility signals can be characterized as highly fragmented and lacking mutual interconnections. This issue is even more prominent due to a lack of an up-to-date overview of research works on automatic credibility assessment. In this survey, we provide such systematic and comprehensive literature review of 175 research papers while focusing on textual credibility signals and Natural Language Processing (NLP), which undergoes a significant advancement due to Large Language Models (LLMs). While positioning the NLP research into the context of other multidisciplinary research works, we tackle with approaches for credibility assessment as well as with 9 categories of credibility signals (we provide a thorough analysis for 3 of them, namely: 1) factuality, subjectivity and bias, 2) persuasion techniques and logical fallacies, and 3) claims and veracity). Following the description of the existing methods, datasets and tools, we identify future challenges and opportunities, while paying a specific attention to recent rapid development of generative AI.
Problem

Research questions and friction points this paper is trying to address.

Automatically assessing online content credibility using textual signals
Integrating fragmented research on multiple credibility signal detection
Providing comprehensive overview of NLP credibility assessment methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Systematic literature review of 175 NLP papers
Analyzes nine categories of textual credibility signals
Focuses on factuality, persuasion techniques, and bias
🔎 Similar Papers
No similar papers found.
Ivan Srba
Ivan Srba
Kempelen Institute of Intelligent Technologies
AIMachine LearningNatural Language ProcessingSocial ComputingDisinformation
O
Olesya Razuvayevskaya
The University of Sheffield, UK
J
João A. Leite
The University of Sheffield, UK
R
Róbert Móro
Kempelen Institute of Intelligent Technologies, Slovakia
Ipek Baris Schlicht
Ipek Baris Schlicht
Universitat Politècnica de València, Deutsche Welle
Applied NLPMisinformation DetectionComputational Journalism
Sara Tonelli
Sara Tonelli
Head of Research Unit, Fondazione Bruno Kessler
Natural Language ProcessingDigital HumanitiesArtificial Intelligence
F
Francisco Moreno Garc'ia
Universidad Politécnica de Madrid, Spain
S
Santiago Barrio Lottmann
Universidad Politécnica de Madrid, Spain
D
Denis Teyssou
Agence France-Presse, France
V
Valentin Porcellini
Agence France-Presse, France
Carolina Scarton
Carolina Scarton
Senior Lecturer in Natural Language Processing, NLP group / GATE group, University of Sheffield
Social Media AnalysisText SimplificationMachine TranslationNatural Language ProcessingArtificial Intelligence
Kalina Bontcheva
Kalina Bontcheva
Professor of Text Analytics, University of Sheffield
Natural Language Processing
M
M. Bieliková
Kempelen Institute of Intelligent Technologies, Slovakia