🤖 AI Summary
Systematic literature reviews (SLRs) are time-consuming, error-prone, and poorly supported by existing tools, which only automate isolated steps rather than enabling end-to-end intelligent collaboration. This paper proposes a semi-automated SLR framework integrating iterative snowballing retrieval, human-in-the-loop screening, and large language model (LLM)-driven content understanding—including topic modeling, key information extraction, and intelligent question answering. Its core contribution lies in deeply embedding LLMs across the entire SLR workflow to enable semantic-aware literature analysis and dynamic, feedback-driven iteration; reliability is ensured via configurable human verification checkpoints. Empirical evaluation demonstrates that the approach reduces screening and analysis time by 2.3× on average, improves annotation consistency (Cohen’s Kappa increases by 0.32), and enhances procedural reproducibility. The framework has been validated across diverse domains—including software engineering, medicine, and education—confirming its cross-domain efficacy.
📝 Abstract
Systematic reviews and mapping studies are critical for synthesizing research, identifying gaps, and guiding future work, but they are often labor-intensive and time-consuming. Existing tools provide partial support for specific steps, leaving much of the process manual and error-prone. We present ProfOlaf, a semi-automated tool designed to streamline systematic reviews while maintaining methodological rigor. ProfOlaf supports iterative snowballing for article collection with human-in-the-loop filtering and uses large language models to assist in analyzing articles, extracting key topics, and answering queries about the content of papers. By combining automation with guided manual effort, ProfOlaf enhances the efficiency, quality, and reproducibility of systematic reviews across research fields. A video describing and demonstrating ProfOlaf is available at: https://youtu.be/4noUXfcmxsE