🤖 AI Summary
This paper addresses the critical question of why machine learning (ML) has failed to substantively advance high-dimensional vector search. Employing meta-research methodologies—including bibliometric analysis, paradigm comparison, and conceptual mapping—it systematically uncovers the root causes of the long-standing parallel evolution between ML and vector search: fundamental misalignments in algorithmic objectives (e.g., interpretability vs. accuracy), evaluation paradigms (e.g., theoretical bounds vs. engineering metrics), and industrial requirements (e.g., low latency, scalability). The study introduces, for the first time, the conceptual framework of “cross-domain co-evolution,” critically identifying three core barriers: weak theoretical connections, fragmented evaluation ecosystems, and mismatched technology transfer. These findings provide both a rigorous theoretical foundation and actionable design principles for developing next-generation vector retrieval systems natively grounded in ML principles.
📝 Abstract
Machine learning and vector search are two research topics that developed in parallel in nearby communities. However, unlike many other fields related to big data, machine learning has not significantly impacted vector search. In this opinion paper we attempt to explain this oddity. Along the way, we wander over the numerous bridges between the two fields.