🤖 AI Summary
Accurate pedestrian localization in non-line-of-sight (NLoS) urban scenarios—such as T-junctions—is challenging: millimeter-wave (mmWave) radar suffers from multipath-induced point cloud spatial distortion, while cameras lack depth perception and cannot observe occluded regions. To address this, we propose the first vision-guided mmWave radar framework for NLoS pedestrian localization. Our method leverages camera-based semantic segmentation to extract road-structure priors, which guide spatiotemporal alignment and geometric reasoning over radar point clouds, effectively suppressing multipath-induced localization errors. By performing cross-modal image-radar joint reconstruction, our approach achieves stable, high-precision pedestrian localization within NLoS regions on a real-world vehicular platform. Experimental results demonstrate significant improvements in robustness and accuracy for complex urban perception, establishing a new paradigm for multimodal NLoS sensing.
📝 Abstract
Pedestrians Localization in Non-Line-of-Sight (NLoS) regions within urban environments poses a significant challenge for autonomous driving systems. While mmWave radar has demonstrated potential for detecting objects in such scenarios, the 2D radar point cloud (PCD) data is susceptible to distortions caused by multipath reflections, making accurate spatial inference difficult. Additionally, although camera images provide high-resolution visual information, they lack depth perception and cannot directly observe objects in NLoS regions. In this paper, we propose a novel framework that interprets radar PCD through road layout inferred from camera for localization of NLoS pedestrians. The proposed method leverages visual information from the camera to interpret 2D radar PCD, enabling spatial scene reconstruction. The effectiveness of the proposed approach is validated through experiments conducted using a radar-camera system mounted on a real vehicle. The localization performance is evaluated using a dataset collected in outdoor NLoS driving environments, demonstrating the practical applicability of the method.