🤖 AI Summary
To address the challenges of unstructured requirements documents, low efficiency, and error-proneness in manual processing, this paper proposes a dual-module framework for requirements engineering automation. The framework integrates heuristic text rules with an adaptively fine-tuned Transformer encoder to achieve end-to-end schematization of semi-structured requirements—jointly performing information extraction, standardized schema mapping, and automatic semantic labeling from raw textual input. It supports direct export to mainstream requirements management tools and seamless integration into existing RE toolchains. As the first systematic solution targeting structured transformation of requirements documentation, it achieves an average 18.7% improvement in F1-score across multiple industrial datasets, significantly enhancing classification accuracy and engineering reusability.
📝 Abstract
This paper presents the ReXCL tool, which automates the extraction and classification processes in requirement engineering, enhancing the software development lifecycle. The tool features two main modules: Extraction, which processes raw requirement documents into a predefined schema using heuristics and predictive modeling, and Classification, which assigns class labels to requirements using adaptive fine-tuning of encoder-based models. The final output can be exported to external requirement engineering tools. Performance evaluations indicate that ReXCL significantly improves efficiency and accuracy in managing requirements, marking a novel approach to automating the schematization of semi-structured requirement documents.