Prompt-with-Me: in-IDE Structured Prompt Management for LLM-Driven Software Engineering

📅 2025-09-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) are increasingly deployed in software engineering, yet ad-hoc prompt management hinders reliability, reusability, and industrial adoption efficiency. Method: We propose the first software-engineering–specific four-dimensional prompt taxonomy—categorizing prompts by intent, role, software lifecycle phase, and prompt type—and design an integrated prompt management framework supporting sensitive information masking, automated template extraction, and language-optimized suggestions. Contribution/Results: Evaluated on 1,108 real-world prompts, our taxonomy achieves high classification accuracy. A user study with 11 practitioners demonstrates strong usability (System Usability Scale score = 73) and low cognitive load (NASA-TLX score = 21), significantly reducing redundant effort. This work establishes a scalable, integrable, and structured paradigm for LLM prompt engineering in software engineering contexts.

Technology Category

Application Category

📝 Abstract
Large Language Models are transforming software engineering, yet prompt management in practice remains ad hoc, hindering reliability, reuse, and integration into industrial workflows. We present Prompt-with-Me, a practical solution for structured prompt management embedded directly in the development environment. The system automatically classifies prompts using a four-dimensional taxonomy encompassing intent, author role, software development lifecycle stage, and prompt type. To enhance prompt reuse and quality, Prompt-with-Me suggests language refinements, masks sensitive information, and extracts reusable templates from a developer's prompt library. Our taxonomy study of 1108 real-world prompts demonstrates that modern LLMs can accurately classify software engineering prompts. Furthermore, our user study with 11 participants shows strong developer acceptance, with high usability (Mean SUS=73), low cognitive load (Mean NASA-TLX=21), and reported gains in prompt quality and efficiency through reduced repetitive effort. Lastly, we offer actionable insights for building the next generation of prompt management and maintenance tools for software engineering workflows.
Problem

Research questions and friction points this paper is trying to address.

Ad hoc prompt management hinders reliability and reuse in LLM-driven software engineering
Lack of structured prompt classification and integration into industrial development workflows
Need for automated prompt refinement, sensitive information masking, and template extraction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatically classifies prompts using four-dimensional taxonomy
Suggests language refinements and masks sensitive information
Extracts reusable templates from developer prompt library
🔎 Similar Papers
No similar papers found.
Z
Ziyou Li
Delft University of Technology, JetBrains Research
A
Agnia Sergeyuk
JetBrains Research
Maliheh Izadi
Maliheh Izadi
Assistant Professor @ Delft University of Technology, The Netherlands
Software engineeringEvaluationAI4SELLM4CodeAgents