🤖 AI Summary
This work addresses the lack of personalization in existing code intelligence systems, which typically disregard individual developer differences. The authors propose VirtualME, a novel framework that leverages an IDE-embedded infrastructure to continuously capture developer behavior and construct a four-dimensional profile encompassing technical stack, capabilities, behavioral habits, and learning styles. For the first time, this profile is integrated into a code-based question-answering system to deliver personalized assistance. The approach combines log-level behavior extraction, multi-agent task recognition, and a rule-engine-driven profiling model, augmented with a large language model to generate tailored responses. Experimental evaluation on a real-world developer trajectory benchmark demonstrates that VirtualME achieves an average improvement of 33.80% across five evaluation dimensions, significantly outperforming general-purpose baselines.
📝 Abstract
With the advent of large language models, research in automated software engineering has increasingly focused on leveraging these models to achieve a deeper semantic understanding of code or to engineer sophisticated agent-based processes. However, this research trajectory has largely overlooked a critical factor: the developers themselves. Programming is a deeply individualized activity; developers exhibit significant variation in their tool-chain preferences, domain-specific expertise, and problem-solving strategies. Consequently, the current paradigm of one-size-fits-all code intelligence systems struggles to accommodate the needs of individual developers. To address this gap, we introduce VirtualME, a novel IDE-embedded data infrastructure designed to model the developer by continuously capturing and interpreting their dynamic programming behaviors and preferences. VirtualME contains three components. (1) Log-level Behavior Extraction: it captures and extracts developers' log-level behaviors from IDE. (2) Task-level Behavior Recognition: it aggregates log-level behaviors into task-level behaviors via a multi-agent pipeline. (3) Developer-personality Measurement: it builds a rule engine to distill a four-dimensional developer persona: technology stack, ability, behavioral habits, and learning style. On top of VirtualME, we propose a solution for personalized repository-level knowledge Q&A by integrating the developer persona into the Q&A agent. We evaluated VirtualME by building a multi-repository benchmark with real-world developer trajectories, balancing correctness and personalization. Experimental results show that VirtualME-enhanced answers outperform generic baselines on five dimensions, yielding an average 33.80% improvement. Our results demonstrate that abundant, continuous developer-behavior data can pave the new way for adaptive and personalized code intelligence.