VIENA(2): A Driving Anticipation Dataset

Aliakbarian, Mohammad Sadegh; Saleh, Fatemeh Sadat; Salzmann, Mathieu; Fernando, Basura; Petersson, Lars; Andersson, Lars

doi:10.1007/978-3-030-20887-5_28

conference paper

VIENA(2): A Driving Anticipation Dataset

Aliakbarian, Mohammad Sadegh

•

Saleh, Fatemeh Sadat

•

Salzmann, Mathieu

more

January 1, 2019

Computer Vision - Accv 2018, Pt I

14th Asian Conference on Computer Vision (ACCV)

Action anticipation is critical in scenarios where one needs to react before the action is finalized. This is, for instance, the case in automated driving, where a car needs to, e.g., avoid hitting pedestrians and respect traffic lights. While solutions have been proposed to tackle subsets of the driving anticipation tasks, by making use of diverse, task-specific sensors, there is no single dataset or framework that addresses them all in a consistent manner. In this paper, we therefore introduce a new, large-scale dataset, called VIENA2, covering 5 generic driving scenarios, with a total of 25 distinct action classes. It contains more than 15K full HD, 5 s long videos acquired in various driving conditions, weathers, daytimes and environments, complemented with a common and realistic set of sensor measurements. This amounts to more than 2.25M frames, each annotated with an action label, corresponding to 600 samples per action class. We discuss our data acquisition strategy and the statistics of our dataset, and benchmark state-of-the-art action anticipation techniques, including a new multi-modal LSTM architecture with an effective loss function for action anticipation in driving scenarios.

Type

conference paper

DOI

10.1007/978-3-030-20887-5_28

Web of Science ID

WOS:000492901400028

Authors

Aliakbarian, Mohammad Sadegh

•

Saleh, Fatemeh Sadat

•

Salzmann, Mathieu

•

Fernando, Basura

•

Petersson, Lars

•

Andersson, Lars

Publication date

2019-01-01

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG

Published in

Computer Vision - Accv 2018, Pt I

ISBN of the book

978-3-030-20886-8

978-3-030-20887-5

Publisher place

Cham

Series title/Series vol.

Lecture Notes in Computer Science

Volume

11361

Start page

449

End page

466

Subjects

Computer Science, Art...

Engineering, Electric...

Computer Science

Engineering

Peer reviewed

REVIEWED

EPFL units

CVLAB

Event name	Event place	Event date
14th Asian Conference on Computer Vision (ACCV)	Perth, AUSTRALIA	Dec 02-06, 2018

Available on Infoscience

November 8, 2019

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/162794