On the Road to Personalized Code Intelligence: Portraiting and Assisting Developers Based on Their In-IDE Behaviors

📅 2026-05-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of personalization in existing code intelligence systems, which typically disregard individual developer differences. The authors propose VirtualME, a novel framework that leverages an IDE-embedded infrastructure to continuously capture developer behavior and construct a four-dimensional profile encompassing technical stack, capabilities, behavioral habits, and learning styles. For the first time, this profile is integrated into a code-based question-answering system to deliver personalized assistance. The approach combines log-level behavior extraction, multi-agent task recognition, and a rule-engine-driven profiling model, augmented with a large language model to generate tailored responses. Experimental evaluation on a real-world developer trajectory benchmark demonstrates that VirtualME achieves an average improvement of 33.80% across five evaluation dimensions, significantly outperforming general-purpose baselines.
📝 Abstract
With the advent of large language models, research in automated software engineering has increasingly focused on leveraging these models to achieve a deeper semantic understanding of code or to engineer sophisticated agent-based processes. However, this research trajectory has largely overlooked a critical factor: the developers themselves. Programming is a deeply individualized activity; developers exhibit significant variation in their tool-chain preferences, domain-specific expertise, and problem-solving strategies. Consequently, the current paradigm of one-size-fits-all code intelligence systems struggles to accommodate the needs of individual developers. To address this gap, we introduce VirtualME, a novel IDE-embedded data infrastructure designed to model the developer by continuously capturing and interpreting their dynamic programming behaviors and preferences. VirtualME contains three components. (1) Log-level Behavior Extraction: it captures and extracts developers' log-level behaviors from IDE. (2) Task-level Behavior Recognition: it aggregates log-level behaviors into task-level behaviors via a multi-agent pipeline. (3) Developer-personality Measurement: it builds a rule engine to distill a four-dimensional developer persona: technology stack, ability, behavioral habits, and learning style. On top of VirtualME, we propose a solution for personalized repository-level knowledge Q&A by integrating the developer persona into the Q&A agent. We evaluated VirtualME by building a multi-repository benchmark with real-world developer trajectories, balancing correctness and personalization. Experimental results show that VirtualME-enhanced answers outperform generic baselines on five dimensions, yielding an average 33.80% improvement. Our results demonstrate that abundant, continuous developer-behavior data can pave the new way for adaptive and personalized code intelligence.
Problem

Research questions and friction points this paper is trying to address.

personalized code intelligence
developer behavior
IDE interactions
individual differences
code assistance
Innovation

Methods, ideas, or system contributions that make the work stand out.

personalized code intelligence
developer modeling
IDE behavior analysis
multi-agent pipeline
developer persona
🔎 Similar Papers
No similar papers found.