Abstract
This paper focuses on monocular-video-based stationary detection of the pedestrian's intention to enter the traffic lane. We propose a motion contour image based HOG-like descriptor, MCHOG, and a machine learning algorithm that reaches the decision at an accuracy of 99 % within the initial step at the curb of smart infrastructure. MCHOG implicitly comprises the body language of gait initiation, especially the body bending and the spread of legs. In a case study at laboratory conditions we present ROC performance data and an evaluation of the span of time necessary for recognition. While MCHOG in special cases indicates detection of the intention before the whole body moves, on average it allows for detection of the movement within 6 frames at a frame rate of 50 Hz and an accuracy of 80 %. Feasibility of the method in a real world intersection scenario is demonstrated.
Submitted Version (Free)
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have