When Embedding Models Meet: Procrustes Bounds and Applications

📅 2025-10-15

📈 Citations: 0

✨ Influential: 0

career value

175K/year

🤖 AI Summary

Embedding models trained independently on similar data capture stable semantic meanings but yield inconsistent representation spaces, hindering interoperability across models. This work addresses compatibility challenges in multimodal search and model upgrades via orthogonal transformation-based embedding alignment. Theoretically, we derive the first tight Procrustes alignment error bound, proving the existence of a near-isometric orthogonal transformation that approximately preserves pairwise inner products—establishing rigorous theoretical foundations for alignment. Methodologically, we employ efficient Procrustes analysis as a post-hoc alignment procedure, preserving the intrinsic geometric structure of each embedding space while enabling cross-model alignment. Experiments demonstrate substantial improvements in model retraining compatibility, text retrieval fusion accuracy, and cross-modal search performance; our method achieves state-of-the-art results in hybrid multimodal search.

Technology Category

Application Category

📝 Abstract

Embedding models trained separately on similar data often produce representations that encode stable information but are not directly interchangeable. This lack of interoperability raises challenges in several practical applications, such as model retraining, partial model upgrades, and multimodal search. Driven by these challenges, we study when two sets of embeddings can be aligned by an orthogonal transformation. We show that if pairwise dot products are approximately preserved, then there exists an isometry that closely aligns the two sets, and we provide a tight bound on the alignment error. This insight yields a simple alignment recipe, Procrustes post-processing, that makes two embedding models interoperable while preserving the geometry of each embedding space. Empirically, we demonstrate its effectiveness in three applications: maintaining compatibility across retrainings, combining different models for text retrieval, and improving mixed-modality search, where it achieves state-of-the-art performance.

Problem

Research questions and friction points this paper is trying to address.

Aligning embeddings from separately trained models

Enabling interoperability through orthogonal transformations

Improving multimodal search and model compatibility

Innovation

Methods, ideas, or system contributions that make the work stand out.

Aligns embeddings via orthogonal transformation

Uses Procrustes post-processing for interoperability

Preserves embedding geometry while enabling compatibility

🔎 Similar Papers

No similar papers found.