Understanding the underlying relationship between pedestrian injury severity outcomes and factors leading to more severe injuries is very important in dealing with the problem of pedestrian safety. To investigate injury severity outcomes, many previous works relied on statistical regression models. There has also been some interest for data mining techniques, in particular for clustering techniques which segment the data into more homogeneous subsets. This research combines these two approaches (data mining and statistical regression methods) to identify the main contributing factors associated with the levels of pedestrian injury severity outcomes. This work relies on the analysis of two unique pedestrian injury severity datasets from the City of New York, US (2002US ( -2006 and the City of Montreal, Canada (2003-2006. General injury severity models were estimated for the whole datasets and for subpopulations obtained through clustering analysis. This paper shows how the segmentation of the accident datasets help to better understand the complex relationship between the injury severity outcomes and the contributing geometric, built environment and socio-demographic factors. While using the same methodology for the two datasets, different techniques were tested. For instance, for New York, latent class with ordered probit method provides the best results. However, for Montreal, the K-means with multinomial logit model is identified as the most appropriate technique. The results show the power of using clustering with regression to provide a complementary and more detailed analysis. Among other results, it was found that pedestrian age, location at intersection, actions prior to accident, driver age, vehicle type, vehicle movement, driver alcohol involvement and lighting conditions have an influence on the likelihood of a fatal crash. Moreover, several features within the built environment are shown to have an effect. Finally, the research provides recommendations for policy makers, traffic engineers, and law enforcement to reduce the severity of pedestrian-vehicle collisions.
KEYWORDS:Pedestrian safety, regression, latent class, clustering, severity, built environmental, land use variables Mohamed, Saunier, Miranda-Moreno, Ukkusuri 3
INTRODUCTIONRoad user safety is a primary concern, not only for traffic safety specialists and traffic engineers, but for educators and law enforcement as well. Most importantly, pedestrian safety is a vital traffic issue as all road users are pedestrians at one point or another. Since pedestrians are vulnerable road users and suffer more in road crashes, it is important to understand the factors affecting pedestrian injury severity levels. In this way, traffic engineers, planners, decision makers and law enforcement will be able to precisely target these factors through various counter-measures, such as improvements to motorized vehicles, roadway and pedestrian facility design, control strategies at conflict locations, and driver and pedestrian education programs.This pape...