🤖 AI Summary
The proliferation of misinformation on social media necessitates scalable fact-checking technologies, for which claim detection—identifying objectively verifiable statements—is a critical prerequisite. Existing approaches rely primarily on linguistic cues or claim-worthiness estimation, exhibiting limited robustness when handling ambiguous political statements and short, heterogeneous texts (e.g., tweets). To address semantic ambiguity and format heterogeneity, we propose a retrieval-augmented claim detection framework that jointly models evidence relevance structure signals and source credibility. Our method integrates external knowledge via retrieval while explicitly encoding structural relationships among candidate claims and supporting evidence, alongside calibrated source reliability scores. Evaluated on CT22-test and PoliClaim-test benchmarks, our approach significantly outperforms pure-text baselines and conventional retrieval-based methods, achieving state-of-the-art accuracy and F1-score. Results demonstrate that synergistic multi-signal modeling substantially enhances claim detection performance in complex, real-world contexts.
📝 Abstract
The rapid spread of misinformation on social media underscores the need for scalable fact-checking tools. A key step is claim detection, which identifies statements that can be objectively verified. Prior approaches often rely on linguistic cues or claim check-worthiness, but these struggle with vague political discourse and diverse formats such as tweets. We present RAVE (Retrieval and Scoring Aware Verifiable Claim Detection), a framework that combines evidence retrieval with structured signals of relevance and source credibility. Experiments on CT22-test and PoliClaim-test show that RAVE consistently outperforms text-only and retrieval-based baselines in both accuracy and F1.