Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis

📅 2024-07-21
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
To address degraded multi-object detection and tracking performance in 360° omnidirectional cycling videos—caused by severe distortion, densely packed small objects, and discontinuous object boundaries—this paper proposes an end-to-end solution. First, a quad-view spherical projection strategy is designed to enhance robustness in detecting small objects. Second, DeepSORT is improved with boundary continuity modeling and class-aware association to mitigate trajectory fragmentation and misassociation. Third, we introduce the first overtaking behavior recognition model built upon omnidirectional tracking trajectories. Evaluated on a newly constructed panoramic cycling dataset, our method achieves a significant improvement in detection AP, with MOTA and IDF1 increased by 7.6% and 9.7%, respectively; overtaking detection attains an F-score of 0.88. The complete source code is fully open-sourced to ensure reproducibility.

Technology Category

Application Category

📝 Abstract
Panoramic cycling videos can record 360{deg} views around the cyclists. Thus, it is essential to conduct automatic road user analysis on them using computer vision models to provide data for studies on cycling safety. However, the features of panoramic data such as severe distortions, large number of small objects and boundary continuity have brought great challenges to the existing CV models, including poor performance and evaluation methods that are no longer applicable. In addition, due to the lack of data with annotations, it is not easy to re-train the models. In response to these problems, the project proposed and implemented a three-step methodology: (1) improve the prediction performance of the pre-trained object detection models on panoramic data by projecting the original image into 4 perspective sub-images; (2) introduce supports for boundary continuity and category information into DeepSORT, a commonly used multiple object tracking model, and set an improved detection model as its detector; (3) using the tracking results, develop an application for detecting the overtaking behaviour of the surrounding vehicles. Evaluated on the panoramic cycling dataset built by the project, the proposed methodology improves the average precision of YOLO v5m6 and Faster RCNN-FPN under any input resolution setting. In addition, it raises MOTA and IDF1 of DeepSORT by 7.6% and 9.7% respectively. When detecting the overtakes in the test videos, it achieves the F-score of 0.88. The code is available on GitHub at github.com/cuppp1998/360_object_tracking to ensure the reproducibility and further improvements of results.
Problem

Research questions and friction points this paper is trying to address.

Detecting objects in distorted panoramic cycling videos
Tracking multiple small objects across video boundaries
Analyzing vehicle overtaking maneuvers for cyclist safety
Innovation

Methods, ideas, or system contributions that make the work stand out.

Segmenting panoramic images into perspective sub-images
Modifying tracking models with boundary continuity
Validating through real-world overtaking detection application
🔎 Similar Papers
No similar papers found.
Jingwei Guo
Jingwei Guo
Alibaba Group; University of Liverpool
graph machine learningtest-time domain adaptation
Meihui Wang
Meihui Wang
University College London
urban data scienceurban analyticsGeoAISpace-Time Analytics
I
Ilya Ilyankou
SpaceTimeLab, Department of Civil, Environmental, and Geomatic Engineering, University College London (UCL), UK
N
Natchapon Jongwiriyanurak
SpaceTimeLab, Department of Civil, Environmental, and Geomatic Engineering, University College London (UCL), UK
X
Xiaowei Gao
SpaceTimeLab, Department of Civil, Environmental, and Geomatic Engineering, University College London (UCL), UK
N
Nicola Christie
Centre for Transport Studies, Department of Civil, Environmental, and Geomatic Engineering, University College London (UCL), UK
James Haworth
James Haworth
Associate Professor in Spatio-temporal Analytics, University College London
GIScienceSpatio-temporalMachine LearningTransportIntelligent Transportation Systems