2021
DOI: 10.34172/ehem.2021.25
|View full text |Cite
|
Sign up to set email alerts
|

Application of imputation methods for missing values of PM10 and O3 data: Interpolation, moving average and K-nearest neighbor methods

Abstract: Background: PIn air quality studies, it is very often to have missing data due to reasons such as machine failure or human error. The approach used in dealing with such missing data can affect the results of the analysis. The main aim of this study was to review the types of missing mechanism, imputation methods, application of some of them in imputation of missing of PM10 and O3 in Tabriz, and compare their efficiency. Methods: Methods of mean, EM algorithm, regression, classification and regression tree, pre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(8 citation statements)
references
References 50 publications
0
8
0
Order By: Relevance
“…The bias of multivariate estimations like correlation or regression coefficients is one of the drawbacks of the linear moving average imputation method. In general, there is no link between the values imputed by the mean of the variables and the other variables 42 …”
Section: Methodsmentioning
confidence: 99%
“…The bias of multivariate estimations like correlation or regression coefficients is one of the drawbacks of the linear moving average imputation method. In general, there is no link between the values imputed by the mean of the variables and the other variables 42 …”
Section: Methodsmentioning
confidence: 99%
“…In contrast, machine learning and deep learning approaches often yield superior imputation results but typically necessitate extended imputation durations compared to statistical methods. Concurrently, traditional machine learning approaches, encompassing K-Nearest Neighbor, fuzzy methods, decision trees, support vectors, and other models, have been integrated into the repertoire of techniques for addressing missing values [29][30][31]. A case in point is the work of Honghai et al, where Support Vector Machine (SVM) regression was employed to estimate missing conditional attribute values, illustrating the efficacy of machine learning in enhancing data completeness, but not with large datasets [32].…”
Section: Introductionmentioning
confidence: 99%
“…Urban development and expansion have caused changes in both climatic and atmospheric conditions in recent years (5). These changes have affected the stability of the natural environment and the health of people especially those who live in urban areas (6).…”
Section: Introductionmentioning
confidence: 99%