2019
DOI: 10.1016/j.ins.2018.10.002
|View full text |Cite
|
Sign up to set email alerts
|

Noisy values detection and correction of traffic accident data

Abstract: Death, injury, and disability from road traffic crashes continue to be a major global public health problem. Therefore, methods to reduce accident severity are of significant interest to traffic agencies and the public at large. Noisy data in the traffic accident dataset obscure the discovery of important factors and mislead conclusions. Identifying and correcting noisy values is an important goal of data cleansing and preprocessing. This paper proposes a new algorithm called NoiseCleaner to identify and corre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…De Oña et al [74] also recommend using two or more data analysis techniques. The data heterogeneity can also be addressed by improving data quality through noise reduction, such as by using an algorithm called NoiseCleaner, suggested by Deb and Lew [75]. The review conducted by Gutierrez-Osorio and Pedraza [7] on using analytic algorithms and machine learning methods in traffic safety studies can be further referenced.…”
Section: Discussionmentioning
confidence: 99%
“…De Oña et al [74] also recommend using two or more data analysis techniques. The data heterogeneity can also be addressed by improving data quality through noise reduction, such as by using an algorithm called NoiseCleaner, suggested by Deb and Lew [75]. The review conducted by Gutierrez-Osorio and Pedraza [7] on using analytic algorithms and machine learning methods in traffic safety studies can be further referenced.…”
Section: Discussionmentioning
confidence: 99%
“…The anomalous data may provide false information for the results analysis of a fatal urban traffic accident; thus, detecting abnormal data and recovering the missing data are both essential in data pre-processing. Many techniques have been proposed for outlier detection and correction: linear regression, neural network, decision trees, and the maximum likelihood estimation-based Bayesian dynamic linear model [21][22][23]. For missing value, mainly strategies are deletion, mean imputation [24], and data-driven imputation.…”
Section: Study On the Accident Data Processing Methodsmentioning
confidence: 99%
“…The time sequence of traffic flow parameters not only contains regular fluctuations, but also has its own random fluctuations ( 21 ). The existence of randomness in a single sample will greatly interfere with traffic pattern extraction and law learning, reduce prediction performance, and even cause subsequent misjudgment of traffic state.…”
Section: Methodsmentioning
confidence: 99%