A Multimodal Assistive System for Product Localization and Retrieval for People who are Blind or have Low Vision

📅 2026-01-18
📈 Citations: 0
Influential: 0
📄 PDF

career value

223K/year
🤖 AI Summary
This work addresses the challenge faced by blind or low-vision users in independently locating and retrieving products in physical retail environments. To this end, the authors propose a multimodal wearable assistive system that integrates YOLO-World object detection, a vision-language model (VLM), embedding similarity computation, and color histogram matching within a closed-loop “search–navigate–verify” pipeline. Real-time guidance is delivered through spatialized audio cues. The system represents the first end-to-end application of this synergistic combination of techniques for product localization and verification. It achieves high shelf detection accuracy within 1.5 meters and near-perfect product identification at close range. Moreover, VLM-driven navigation and verification attain accuracies of 94.4% and over 86%, respectively, substantially enhancing users’ autonomy in shopping tasks.

Technology Category

Application Category

📝 Abstract
Shopping is a routine activity for sighted individuals, yet for people who are blind or have low vision (pBLV), locating and retrieving products in physical environments remains a challenge. This paper presents a multimodal wearable assistive system that integrates object detection with vision-language models to support independent product or item retrieval, with the goal of enhancing users'autonomy and sense of agency. The system operates through three phases: product search, which identifies target products using YOLO-World detection combined with embedding similarity and color histogram matching; product navigation, which provides spatialized sonification and VLM-generated verbal descriptions to guide users toward the target; and product correction, which verifies whether the user has reached the correct product and provides corrective feedback when necessary. Technical evaluation demonstrated promising performance across all modules, with product detection achieving near-perfect accuracy at close range and high accuracy when facing shelves within 1.5 m. VLM-based navigation achieved up to 94.4% accuracy, and correction accuracy exceeded 86% under optimal model configurations. These results demonstrate the system's potential to address the last-meter problem in assistive shopping. Future work will focus on user studies with pBLV participants and integration with multi-scale navigation ecosystems.
Problem

Research questions and friction points this paper is trying to address.

product localization
assistive technology
blindness
low vision
shopping assistance
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal assistive system
vision-language model (VLM)
YOLO-World
spatialized sonification
last-meter problem
🔎 Similar Papers
No similar papers found.
L
Ligao Ruan
Department of Mechanical and Aerospace Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA; Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA
Giles Hamilton-Fletcher
Giles Hamilton-Fletcher
Research Scientist, NYU Langone Health
sensory substitutioncross-modal correspondencesqualiasynaesthesia
M
Mahya Beheshti
Department of Mechanical and Aerospace Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA; Department of Rehabilitation Medicine, NYU Grossman School of Medicine, New York, New York, USA
T
Todd E. Hudson
Department of Rehabilitation Medicine, NYU Grossman School of Medicine, New York, New York, USA
M
M. Porfiri
Department of Mechanical and Aerospace Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA; Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA; Center for Urban Science and Progress, NYU Tandon School of Engineering, Brooklyn, New York, USA; Department of Civil and Urban Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA
J
John-Ross Rizzo
Department of Rehabilitation Medicine, NYU Grossman School of Medicine, New York, New York, USA; Department of Radiology, NYU Grossman School of Medicine, New York, New York, USA; Center for Urban Science and Progress, NYU Tandon School of Engineering, Brooklyn, New York, USA; Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, New York, USA