Kernel $k$-Medoids as General Vector Quantization

📅 2025-06-05

📈 Citations: 0

✨ Influential: 0

career value

224K/year

🤖 AI Summary

This work uncovers an intrinsic unification between distance-based $k$-medoids clustering and probability-density-matching vector quantization (VQ) via kernel density estimation (KDE), both formulated within the quadratic unconstrained binary optimization (QUBO) framework. Methodologically, we cast both paradigms as QUBO problems and rigorously prove— for the first time—that the KDE-QUBO formulation is a special case of the $k$-medoids-QUBO problem under a kernel-induced feature mapping. This equivalence is characterized jointly by the maximum mean discrepancy (MMD) and kernel metrics, thereby establishing a structural bridge between distance-driven and distribution-matching VQ. Furthermore, we provide a geometric interpretation of the weighting parameters in VQ, substantially enhancing model interpretability. Collectively, these results yield a unified theoretical foundation for designing efficient, provably optimal hard quantization algorithms.

Technology Category

Application Category

📝 Abstract

Vector Quantization (VQ) is a widely used technique in machine learning and data compression, valued for its simplicity and interpretability. Among hard VQ methods, $k$-medoids clustering and Kernel Density Estimation (KDE) approaches represent two prominent yet seemingly unrelated paradigms -- one distance-based, the other rooted in probability density matching. In this paper, we investigate their connection through the lens of Quadratic Unconstrained Binary Optimization (QUBO). We compare a heuristic QUBO formulation for $k$-medoids, which balances centrality and diversity, with a principled QUBO derived from minimizing Maximum Mean Discrepancy in KDE-based VQ. Surprisingly, we show that the KDE-QUBO is a special case of the $k$-medoids-QUBO under mild assumptions on the kernel's feature map. This reveals a deeper structural relationship between these two approaches and provides new insight into the geometric interpretation of the weighting parameters used in QUBO formulations for VQ.

Problem

Research questions and friction points this paper is trying to address.

Connects k-medoids and KDE via QUBO optimization

Compares heuristic and principled QUBO for VQ

Reveals structural relationship between distance and density methods

Innovation

Methods, ideas, or system contributions that make the work stand out.

Connects k-medoids and KDE via QUBO

Balances centrality and diversity heuristically

Reveals KDE-QUBO as k-medoids special case

🔎 Similar Papers

Robust Clustering on High-Dimensional Data with Stochastic Quantization