🤖 AI Summary
This paper addresses the low efficiency of repetitive data tasks—such as cross-site web scraping, content monitoring, summarization, and preliminary exploration—in investigative journalism. To tackle this, we propose a novel human-AI collaboration paradigm integrating large language models (LLMs) with programming-by-demonstration (PbD). Through an within-subject user study, in-depth interviews, and speculative design, we introduce PbD to investigative journalism workflows for the first time, establishing a journalist-centered task taxonomy (data acquisition and reporting) and identifying six high-value automation opportunities. Our prototype significantly lowers the barrier to cross-site data collection; we distill eight actionable guidelines for AI integration, validated and highly endorsed by frontline investigative journalists. Key contributions include: (1) pioneering an implementation pathway for LLM+PbD in journalistic practice; and (2) advancing a task-oriented, human-centered methodology for AI-augmented journalism.
📝 Abstract
As the capabilities of Large Language Models (LLMs) expand, more researchers are studying their adoption in newsrooms. However, much of the research focus remains broad and does not address the specific technical needs of investigative journalists. Therefore, this paper presents several applied use cases where automation and AI intersect with investigative journalism. We conducted a within-subjects user study with eight investigative journalists. In interviews, we elicited practical use cases using a speculative design approach by having journalists react to a prototype of a system that combines LLMs and Programming-by-Demonstration (PbD) to simplify data collection on numerous websites. Based on user reports, we classified the journalistic processes into data collecting and reporting. Participants indicated they utilize automation to handle repetitive tasks like content monitoring, web scraping, summarization, and preliminary data exploration. Following these insights, we provide guidelines on how investigative journalism can benefit from AI and automation.