2021
DOI: 10.3390/ijerph18147534
|View full text |Cite
|
Sign up to set email alerts
|

Risky Driver Recognition with Class Imbalance Data and Automated Machine Learning Framework

Abstract: Identifying high-risk drivers before an accident happens is necessary for traffic accident control and prevention. Due to the class-imbalance nature of driving data, high-risk samples as the minority class are usually ill-treated by standard classification algorithms. Instead of applying preset sampling or cost-sensitive learning, this paper proposes a novel automated machine learning framework that simultaneously and automatically searches for the optimal sampling, cost-sensitive loss function, and probabilit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 43 publications
0
4
0
Order By: Relevance
“…The model verification was performed through HIL simulation, achieving 83.75% prediction accuracy. Wang et al [220] developed a risky driver prediction model using video data from an overhead traffic drone. The drivers from the video are labeled according to their average collision risk, which is calculated based on the spacing and stopping distance (DSS).…”
Section: A Human Driver Behavior Recognitionmentioning
confidence: 99%
“…The model verification was performed through HIL simulation, achieving 83.75% prediction accuracy. Wang et al [220] developed a risky driver prediction model using video data from an overhead traffic drone. The drivers from the video are labeled according to their average collision risk, which is calculated based on the spacing and stopping distance (DSS).…”
Section: A Human Driver Behavior Recognitionmentioning
confidence: 99%
“…The ratio of the absolute value of DSS to the speed of the rear vehicle can reflect the missing reaction time required by the driver [26]. The formula is shown in Equation (6).…”
Section: Lane-changing Risk Indicatormentioning
confidence: 99%
“…The most common sampling techniques in the literature are the Synthetic Minority Oversampling Technique (SMOTE) [ 16 , 22 , 23 , 24 ] and Adaptive Synthetic (ADASYN) [ 24 ]. In addition, based on the literature review in the field of road safety as well as different scientific areas, additional sampling techniques tend to be efficient methods such as the combination of SMOTE and Edited Nearest Neighbors (SMOTE-ENN) [ 10 ], Random Oversampling, SVM-SMOTE and SMOTE-Tomek [ 25 ].…”
Section: Literature Reviewmentioning
confidence: 99%
“…In summary, this research proposes a framework for (a) defining the STZ levels, and (b) developing and evaluating machine learning algorithms to classify driving behavior and predict the duration that each driver spends at each risk level. This framework also exploits the most important features to identify driving behavior and takes care of dataset imbalance, which is a common problem in road safety analyses [ 10 ]. The paper contributes to the current knowledge in a two-fold matter; initially by identifying the level of safety of drivers in real-time, which is a real-time classification problem, and consequently by predicting the duration of each safety level in real-time.…”
Section: Introductionmentioning
confidence: 99%