Prompt-with-Me: in-IDE Structured Prompt Management for LLM-Driven Software Engineering

📅 2025-09-21

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Large language models (LLMs) are increasingly deployed in software engineering, yet ad-hoc prompt management hinders reliability, reusability, and industrial adoption efficiency. Method: We propose the first software-engineering–specific four-dimensional prompt taxonomy—categorizing prompts by intent, role, software lifecycle phase, and prompt type—and design an integrated prompt management framework supporting sensitive information masking, automated template extraction, and language-optimized suggestions. Contribution/Results: Evaluated on 1,108 real-world prompts, our taxonomy achieves high classification accuracy. A user study with 11 practitioners demonstrates strong usability (System Usability Scale score = 73) and low cognitive load (NASA-TLX score = 21), significantly reducing redundant effort. This work establishes a scalable, integrable, and structured paradigm for LLM prompt engineering in software engineering contexts.

Technology Category

Application Category

📝 Abstract

Large Language Models are transforming software engineering, yet prompt management in practice remains ad hoc, hindering reliability, reuse, and integration into industrial workflows. We present Prompt-with-Me, a practical solution for structured prompt management embedded directly in the development environment. The system automatically classifies prompts using a four-dimensional taxonomy encompassing intent, author role, software development lifecycle stage, and prompt type. To enhance prompt reuse and quality, Prompt-with-Me suggests language refinements, masks sensitive information, and extracts reusable templates from a developer's prompt library. Our taxonomy study of 1108 real-world prompts demonstrates that modern LLMs can accurately classify software engineering prompts. Furthermore, our user study with 11 participants shows strong developer acceptance, with high usability (Mean SUS=73), low cognitive load (Mean NASA-TLX=21), and reported gains in prompt quality and efficiency through reduced repetitive effort. Lastly, we offer actionable insights for building the next generation of prompt management and maintenance tools for software engineering workflows.

Problem

Research questions and friction points this paper is trying to address.

Ad hoc prompt management hinders reliability and reuse in LLM-driven software engineering

Lack of structured prompt classification and integration into industrial development workflows

Need for automated prompt refinement, sensitive information masking, and template extraction

Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatically classifies prompts using four-dimensional taxonomy

Suggests language refinements and masks sensitive information

Extracts reusable templates from developer prompt library

🔎 Similar Papers

No similar papers found.

Authors to Follow