Pedestrian intention prediction: A convolutional bottom-up multi-task approach

Razali, Haziq; Mordan, Taylor; Alahi, Alexandre

doi:10.1016/j.trc.2021.103259

research article

Pedestrian intention prediction: A convolutional bottom-up multi-task approach

Razali, Haziq

•

Mordan, Taylor

•

Alahi, Alexandre

July 15, 2021

Transportation Research Part C: Emerging Technologies

The ability to predict pedestrian behaviour is crucial for road safety, traffic management systems, Advanced Driver Assistance Systems (ADAS), and more broadly autonomous vehicles. We present a vision-based system that simultaneously locates where pedestrians are in the scene, estimates their body pose and predicts their intention to cross the road. Given a single image, our proposed neural network is designed using a bottom-up approach and thus runs at nearly constant time without relying on a pedestrian detector. Our method jointly detects human body poses and predicts their intention in a multitask framework. Experimental results show that the proposed model outperforms the precision scores of the state-of-the-art for the task of intention prediction by approximately 20% while running in real-time (5 fps). The source code is publicly available so that it can be easily integrated into an ADAS or into any traffic light management systems.

Name

Pedestrian_intention_prediction_A_convolutional_bottom-up_multi-task_approach.pdf

Type

Publisher's version

Access type

openaccess

License Condition

CC BY-NC-ND

Size

16.48 MB

Format

Adobe PDF

Checksum (MD5)

0242e9e13b9ec8f26daf1e4d18cb001e