A new spatial count data model with Bayesian additive regression trees for accident hot spot identification

Krueger, Rico; Bansal, Prateek; Buddhavarapu, Prasad

doi:10.1016/j.aap.2020.105623

Krueger, Rico; Bansal, Prateek; Buddhavarapu, Prasad

2020

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The identification of accident hot spots is a central task of road safety management. Bayesian count data models have emerged as the workhorse method for producing probabilistic rankings of hazardous sites in road networks. Typically, these methods assume simple linear link function specifications, which, however, limit the predictive power of a model. Furthermore, extensive specification searches are precluded by complex model structures arising from the need to account for unobserved heterogeneity and spatial correlations. Modern machine learning (ML) methods offer ways to automate the specification of the link function. However, these methods do not capture estimation uncertainty, and it is also difficult to incorporate spatial correlations. In light of these gaps in the literature, this paper proposes a new spatial negative binomial model which uses Bayesian additive regression trees to endogenously select the specification of the link function. Posterior inference in the proposed model is made feasible with the help of the Polya-Gamma data augmentation technique. We test the performance of this new model on a crash count data set from a metropolitan highway network. The empirical results show that the proposed model performs at least as well as a baseline spatial count data model with random parameters in terms of goodness of fit and site ranking ability.

Details

Title A new spatial count data model with Bayesian additive regression trees for accident hot spot identification

Author(s) Krueger, Rico ; Bansal, Prateek ; Buddhavarapu, Prasad

Published in Accident Analysis And Prevention

Volume 144

Pages 105623

Date 2020-09-01

ISSN 0001-4575
1879-2057

Keywords

accident analysis; site ranking; spatial count data modelling; negative binomial model; bayesian additive regression trees; polya-gamma data augmentation; negative binomial regression; support vector machine; crash-frequency; neural-network; unobserved heterogeneity; transportation safety; statistical-analysis; random-parameters; empirical bayes; prediction

Note This is an open access article under the CC BY license.

DOI https://doi.org/10.1016/j.aap.2020.105623

Other identifier(s) View record in Web of Science

Laboratories TRANSP-OR

Record Appears in Scientific production and competences > ENAC - School of Architecture, Civil and Environmental Engineering > IIC - Civil Engineering Institute > TRANSP-OR - Transportation and Mobility Laboratory
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2020-09-16

Actions

Preview

Select file: