🤖 AI Summary
This study investigates the geometric distinctions between antonym and synonym pairs in word embeddings, with a focus on whether their difference vectors encode a discernible structural signature of "antonymy." By computing difference vectors for such word pairs across multiple mainstream embedding models and visualizing them via UMAP dimensionality reduction, the research reveals that antonym pairs consistently exhibit a stable, vortex-like geometric pattern in low-dimensional space—a configuration markedly distinct from that of synonym pairs and robust across models. These findings suggest that semantic opposition may possess a universal geometric representation within word embedding spaces, offering a novel perspective on the internal semantic organization of these models.
📝 Abstract
Antonyms, or opposites, are sometimes defined as \emph{word pairs that have all of the same contextually relevant properties but one}. Seeing how transformer models seem to encode concepts as directions, this begs the question if one can detect ``antonymity'' in the geometry of the embedding vectors of word pairs, especially based on their difference vectors. Such geometrical studies are then naturally contrasted by comparing antonymic pairs to their opposites; synonyms.
This paper started as an exploratory project on the complexity of the systems needed to detect the geometry of the embedding vectors of antonymic word pairs. What we now report is a curious ``swirl'' that appears across embedding models in a somewhat specific projection configuration.