🤖 AI Summary
Legal practitioners require efficient and precise key information extraction during the Early Case Assessment (ECA) phase of electronic discovery (eDiscovery), yet conventional keyword and semantic search approaches exhibit notable limitations. This study presents the first systematic evaluation of five retrieval strategies—keyword, semantic, vector, hybrid, and hybrid-semantic—within a legal Retrieval-Augmented Generation (RAG) framework powered by Azure AI Search and large language models (LLMs) for natural-language question answering. Performance is measured along three dimensions: accuracy, relevance, and response consistency. Results demonstrate that hybrid-semantic retrieval significantly outperforms all alternatives, achieving the highest accuracy and stability in legal document review tasks. This work provides empirically grounded guidance for configuring optimal RAG systems in ECA, offering both methodological rigor and practical implementation insights. It addresses a critical gap in the literature by delivering the first comparative analysis of retrieval strategies specifically tailored to the legal domain.
📝 Abstract
Increasingly, attorneys are interested in moving beyond keyword and semantic search to improve the efficiency of how they find key information during a document review task. Large language models (LLMs) are now seen as tools that attorneys can use to ask natural language questions of their data during document review to receive accurate and concise answers. This study evaluates retrieval strategies within Microsoft Azure's Retrieval-Augmented Generation (RAG) framework to identify effective approaches for Early Case Assessment (ECA) in eDiscovery. During ECA, legal teams analyze data at the outset of a matter to gain a general understanding of the data and attempt to determine key facts and risks before beginning full-scale review. In this paper, we compare the performance of Azure AI Search's keyword, semantic, vector, hybrid, and hybrid-semantic retrieval methods. We then present the accuracy, relevance, and consistency of each method's AI-generated responses. Legal practitioners can use the results of this study to enhance how they select RAG configurations in the future.