Wildlife‒vehicle collision (WVC) data usually contain two types: the reported WVC data and carcass removal data. Previous studies often found a discrepancy between the number of reported WVC and carcass removal data, and the quality of both datasets is affected by underreporting. Underreporting means the number of WVCs is not fully recorded in the database; neglecting the underreporting in WVC data may result in biased parameter estimation results. In this study, a copula regression model linking wildlife‒vehicle collisions and the underreporting outcome was proposed to consider the underreporting in WVC data. The WVC data collected from 10 highways in Washington State were analyzed using the copula regression model and the Negative Binomial (NB) model. The main findings from this study are as follows: (1) the Gaussian copula model can provide different modeling results when compared with the conventional modeling approach; (2) the hotspot identification results indicate that the Gaussian copula-based Empirical Bayes (EB) method can more accurately identify hotspots than the NB-based EB method. Thus, the proposed copula model may be a better alternative to the conventional NB model for modeling underreported WVC data.
Crash modification factors (CMFs) can be used to capture the safety effects of countermeasures and play a significant role in traffic safety management. As an alternative to the before-and-after study, the regression model method has been widely used for estimating CMFs. Although before-and-after studies are considered to be superior, the use of regression models for estimating CMFs has never been fully investigated. Consequently, the conditions in which regression models could be used for such a purpose were examined. CMFs for three variables—lane width, curve density, and pavement friction—were assumed and used for generating random crash counts. Then CMFs were derived from regression models by using the simulated crash data for three different scenarios. The results were then compared with the assumed true values. The study results showed that (a) when all factors affecting traffic safety are identical in all segments except those of interest, CMFs derived from regression models should be unbiased; (b) if some factors having minor safety effects are omitted from the models, the accuracy of estimated CMFs can still be acceptable; and (c) if some factors already known to have significant effects on crash risk are omitted, CMFs derived from the regression models are generally unreliable. Thus, depending on missing variables not included in the model, the transportation safety analyst can decide whether CMFs developed from regression models should be used for highway safety applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.