MINT: Multi-Vector Search Index Tuning

๐Ÿ“… 2025-04-28
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
In multi-vector databases, systematic index tuning methodologies are lacking for multimodal or multi-feature scenarios, leading to high query latency and suboptimal trade-offs among storage cost, recall, and efficiency. Method: This paper formally defines the multi-vector search index tuning problem and proposes a holistic framework that jointly optimizes query latency, storage overhead, and recallโ€”departing from conventional single-vector or relational indexing paradigms. It introduces a workload-driven search space pruning algorithm, a multi-objective constrained modeling mechanism, and an efficient index evaluator. Contribution/Results: Evaluated on real-world multi-vector workloads, our approach reduces query latency by 2.1ร—โ€“8.3ร— over state-of-the-art baselines while satisfying user-specified storage and recall constraints. It identifies Pareto-optimal index configurations, enabling principled, workload-aware index selection in multi-vector settings.

Technology Category

Application Category

๐Ÿ“ Abstract
Vector search plays a crucial role in many real-world applications. In addition to single-vector search, multi-vector search becomes important for multi-modal and multi-feature scenarios today. In a multi-vector database, each row is an item, each column represents a feature of items, and each cell is a high-dimensional vector. In multi-vector databases, the choice of indexes can have a significant impact on performance. Although index tuning for relational databases has been extensively studied, index tuning for multi-vector search remains unclear and challenging. In this paper, we define multi-vector search index tuning and propose a framework to solve it. Specifically, given a multi-vector search workload, we develop algorithms to find indexes that minimize latency and meet storage and recall constraints. Compared to the baseline, our latency achieves 2.1X to 8.3X speedup.
Problem

Research questions and friction points this paper is trying to address.

Optimizing index selection for multi-vector search performance
Addressing latency, storage, and recall constraints in vector databases
Improving search efficiency in multi-modal and multi-feature scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-vector search index tuning framework
Algorithms optimize latency and constraints
Achieves 2.1X to 8.3X speedup
๐Ÿ”Ž Similar Papers
No similar papers found.