FaST: Feature-aware Sampling and Tuning for Personalized Preference Alignment with Limited Data

📅 2025-08-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of personalizing large language model (LLM)-based dialogue assistants under extreme scarcity of individual user preference data—often limited to only a few annotated examples—introducing the novel task of *Personalized Preference Alignment under Limited Data* (PPALLI). To tackle this, we propose the Feature-aware Sampling and Tuning framework (FaST), which first automatically discovers high-level semantic features from sparse user feedback, then performs parameter-efficient fine-tuning guided by these features. We evaluate FaST on two newly constructed datasets, DnD and ELIP, demonstrating consistent and significant improvements over state-of-the-art personalized alignment methods across multiple low-resource benchmarks. Results confirm its effectiveness, generalizability, and deployment feasibility. Our core contribution lies in tightly coupling interpretable feature discovery with lightweight adaptation—enabling scalable, transparent, and sample-efficient personalized alignment for the first time in ultra-low-data regimes.

Technology Category

Application Category

📝 Abstract
LLM-powered conversational assistants are often deployed in a one-size-fits-all manner, which fails to accommodate individual user preferences. Recently, LLM personalization -- tailoring models to align with specific user preferences -- has gained increasing attention as a way to bridge this gap. In this work, we specifically focus on a practical yet challenging setting where only a small set of preference annotations can be collected per user -- a problem we define as Personalized Preference Alignment with Limited Data (PPALLI). To support research in this area, we introduce two datasets -- DnD and ELIP -- and benchmark a variety of alignment techniques on them. We further propose FaST, a highly parameter-efficient approach that leverages high-level features automatically discovered from the data, achieving the best overall performance.
Problem

Research questions and friction points this paper is trying to address.

Aligning LLMs with individual user preferences using limited data
Addressing Personalized Preference Alignment with Limited Data (PPALLI)
Improving personalization efficiency via feature-aware sampling and tuning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Feature-aware sampling for personalized alignment
Parameter-efficient tuning with limited data
Automatic high-level feature discovery
🔎 Similar Papers
No similar papers found.