🤖 AI Summary
To address the low quality, lack of foresight, and poor feasibility of future-work suggestions in scientific writing, this paper proposes a novel framework integrating retrieval-augmented generation (RAG) with large language model (LLM)-driven self-feedback iteration. Methodologically, it introduces the “LLM-as-a-judge” paradigm for automated evaluation, coupled with multi-source literature retrieval (focusing on key sections), fine-tuned LLM-based generation, and a progressive optimization mechanism guided by LLM-generated feedback. Experimental results demonstrate significant improvements over baselines across quantitative metrics—including relevance, feasibility, and foresight—as well as in expert evaluations. Human assessment further confirms its high reliability as a tool for identifying and discriminating promising future research directions, effectively supporting both novice and experienced researchers in uncovering knowledge gaps and potential collaborative opportunities.
📝 Abstract
The future work section of a scientific article outlines potential research directions by identifying gaps and limitations of a current study. This section serves as a valuable resource for early-career researchers seeking unexplored areas and experienced researchers looking for new projects or collaborations. In this study, we generate future work suggestions from key sections of a scientific article alongside related papers and analyze how the trends have evolved. We experimented with various Large Language Models (LLMs) and integrated Retrieval-Augmented Generation (RAG) to enhance the generation process. We incorporate a LLM feedback mechanism to improve the quality of the generated content and propose an LLM-as-a-judge approach for evaluation. Our results demonstrated that the RAG-based approach with LLM feedback outperforms other methods evaluated through qualitative and quantitative metrics. Moreover, we conduct a human evaluation to assess the LLM as an extractor and judge. The code and dataset for this project are here, code: HuggingFace