STAMP: Selective Task-Aware Mechanism for Text Privacy

πŸ“… 2026-03-12
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenge of balancing utility and privacy in text privacy preservation, where uniform noise addition often leads to suboptimal trade-offs. To this end, we propose STAMP, a novel framework that first jointly evaluates each token’s importance to downstream tasks and its privacy sensitivity, enabling task-aware, fine-grained privacy budget allocation. We further introduce a polar-coordinate perturbation mechanism that preserves the norm of embeddings while perturbing only their direction on the unit hypersphere, aligning privacy noise with the geometry of decoding. By integrating task-specific representations, sensitivity analysis, and cosine-based nearest-neighbor decoding, STAMP consistently outperforms existing methods across SQuAD, Yelp, and AG News benchmarks, achieving superior privacy-utility trade-offs under varying privacy budgets.

Technology Category

Application Category

πŸ“ Abstract
We present STAMP (Selective Task-Aware Mechanism for Text Privacy), a new framework for task-aware text privatization that achieves an improved privacy-utility trade-off. STAMP selectively allocates privacy budgets across tokens by jointly considering (i) each token's importance to the downstream task (as measured via a task- or query-specific representation), and (ii) its privacy sensitivity (e.g., names, dates, identifiers). This token-level partitioning enables fine-grained, group-wise control over the level of noise applied to different parts of the input, balancing privacy protection with task relevance. To privatize individual token embeddings, we introduce the polar mechanism, which perturbs only the direction of embeddings on the unit sphere while preserving their magnitude. Decoding is performed via cosine nearest-neighbor search, aligning the perturbation geometry with the decoding geometry. Unlike isotropic noise mechanisms, the polar mechanism maintains semantic neighborhoods in the embedding space and better preserves downstream utility. Experimental evaluations on SQuAD, Yelp, and AG News datasets demonstrate that STAMP, when combined with the normalized polar mechanism, consistently achieves superior privacy-utility trade-offs across varying per-token privacy budgets.
Problem

Research questions and friction points this paper is trying to address.

text privacy
privacy-utility trade-off
task-aware privatization
differential privacy
token-level privacy
Innovation

Methods, ideas, or system contributions that make the work stand out.

task-aware privacy
token-level privacy budgeting
polar mechanism
differential privacy
embedding perturbation
πŸ”Ž Similar Papers
No similar papers found.