🤖 AI Summary
This work addresses the challenge of extracting shared latent structures from two-view multimodal data when entries are missing completely at random. Under a high-dimensional proportional spiked model, the masked cross-covariance matrix is normalized into a signal-attenuated spiked rectangular random matrix, revealing for the first time a missingness-induced Baik–Ben Arous–Péché (BBP)-type phase transition. The authors derive analytically the critical signal-to-noise ratio threshold and a closed-form expression for the asymptotic alignment between the leading singular vectors and the underlying shared directions. Leveraging tools from high-dimensional random matrix theory and spectral partial least squares (PLS) analysis, the theoretical predictions exhibit excellent agreement with simulations and semi-synthetic multimodal experiments across varying aspect ratios, signal strengths, and missing rates, thereby validating both the phase transition boundary and the recovery performance.
📝 Abstract
Partial Least Squares (PLS) learns shared structure from paired data via the top singular vectors of the empirical cross-covariance (PLS-SVD), but multimodal datasets often have missing entries in both views. We study PLS-SVD under independent entry-wise missing-completely-at-random masking in a proportional high-dimensional spiked model. After appropriate normalization, the masked cross-covariance behaves like a spiked rectangular random matrix whose effective signal strength is attenuated by $\sqrt{\rho}$, where $\rho$ is the joint entry retention probability. As a result, PLS-SVD exhibits a sharp BBP-type phase transition: below a critical signal-to-noise threshold the leading singular vectors are asymptotically uninformative, while above it they achieve nontrivial alignment with the latent shared directions, with closed-form asymptotic overlap formulas. Simulations and semi-synthetic multimodal experiments corroborate the predicted phase diagram and recovery curves across aspect ratios, signal strengths, and missingness levels.