Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval

πŸ“… 2026-05-06
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

228K/year
πŸ€– AI Summary
This study investigates the theoretical storage capacity limits of linear associative memory models under distinct retrieval criteria. Focusing on Gaussian-distributed linear memory matrices, it distinguishes between winner-take-all (top-1) and list-based retrieval mechanisms, leveraging tools from random matrix theory, extreme value statistics, and convex optimization to establish precise asymptotic capacity characterizations. The work demonstrates that top-1 retrieval incurs a logarithmic capacity penalty and proves that correlated matrix constructions achieve optimal logarithmic scaling in this regime. Furthermore, it introduces the tail-average margin (TAM) criterion, which enables quadratic capacity scaling under list-based retrieval, yielding closed-form expressions for the critical load and the distribution of retrieval scores. The analysis also leads to the conjecture that the top-1 threshold satisfies \(d^2 \sim 2n \log n\).
πŸ“ Abstract
How many key-value associations can a $d\times d$ linear memory store? We show that the answer depends not only on the $d^2$ degrees of freedom in the memory matrix, but also on the retrieval criterion. In an isotropic Gaussian model for the stored pairs, we show that top-1 retrieval, where every signal must beat its largest distractor, requires the logarithmic model-size scale $d^2\asymp n\log n$. We prove that the correlation matrix memory construction, which stores associations by superposing key-target outer products, achieves this scale through a sharp phase transition, and that the same scaling is necessary for any linear memory. Thus the logarithm is the intrinsic extreme-value price of winner-take-all decoding. We next consider listwise retrieval, where the correct target need not be the unique top-scoring item but should remain among the strongest candidates. To formalize this regime, we propose the Tail-Average Margin (TAM), a convex upper-tail criterion that certifies inclusion of the correct target in a controlled candidate list. Under this listwise retrieval criterion, the capacity follows the quadratic scale $d^2\asymp n$. At load $n/d^2\toΞ±$, we develop an exact asymptotic theory for the TAM empirical-risk minimizer through a two-parameter scalar variational principle. The theory has a rich phenomenology: in the ridgeless limit it yields a closed-form critical load separating satisfiable and unsatisfiable phases, and it predicts the limiting laws of true scores, competitor scores, margins, and percentile profiles. Finally, a small-tail extrapolation further leads to the conjectural sharp top-1 threshold $d^2\sim 2n\log n$.
Problem

Research questions and friction points this paper is trying to address.

associative memory
capacity threshold
linear memory
retrieval criterion
extreme-value statistics
Innovation

Methods, ideas, or system contributions that make the work stand out.

linear associative memory
capacity threshold
listwise retrieval
Tail-Average Margin
phase transition
πŸ”Ž Similar Papers