2023
DOI: 10.1155/2023/4916267
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of Air Quality Index Using Machine Learning Techniques: A Comparative Analysis

Abstract: An index for reporting air quality is called the air quality index (AQI). It measures the impact of air pollution on a person’s health over a short period of time. The purpose of the AQI is to educate the public on the negative health effects of local air pollution. The amount of air pollution in Indian cities has significantly increased. There are several ways to create a mathematical formula to determine the air quality index. Numerous studies have found a link between air pollution exposure and adverse heal… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 46 publications
(5 citation statements)
references
References 38 publications
0
3
0
Order By: Relevance
“…Te AQI [20] of New Delhi, Bangalore, Kolkata, and Hyderabad has been calculated using three diferent techniques: support vector regression (SVR), random forest regression (RFR), and CatBoost regression (CR). Random forest regression yields lower root mean square error (RMSE) values in Bangalore (0.5674), Kolkata (0.1403), and Hyderabad (0.3826) and higher accuracy in comparison to SVR and CatBoost regression for Kolkata (90.9700%) and Hyderabad (78.3672%), while CatBoost regression yields lower RMSE values in New Delhi (0.2792) and the highest accuracy in New Delhi (79.8622%) and Bangalore (68.6860%).…”
Section: Literature Surveymentioning
confidence: 99%
“…Te AQI [20] of New Delhi, Bangalore, Kolkata, and Hyderabad has been calculated using three diferent techniques: support vector regression (SVR), random forest regression (RFR), and CatBoost regression (CR). Random forest regression yields lower root mean square error (RMSE) values in Bangalore (0.5674), Kolkata (0.1403), and Hyderabad (0.3826) and higher accuracy in comparison to SVR and CatBoost regression for Kolkata (90.9700%) and Hyderabad (78.3672%), while CatBoost regression yields lower RMSE values in New Delhi (0.2792) and the highest accuracy in New Delhi (79.8622%) and Bangalore (68.6860%).…”
Section: Literature Surveymentioning
confidence: 99%
“…In the RF, each node is split using the optimal splitter chosen from a subset of predictors. At every node, random predictors are utilized, and this element of randomness offers overfit protection (Alamsyah & Salma, 2018;Schonlau & Zou, 2020;Yarragunta et al, 2021;Hai et al, 2022;Benifa et al, 2022;Ravindiran et al, 2023;Baladjay et al, 2023;Gupta et al, 2023;Elvin, 2024;Aram et al, 2024). When presented with new data, each DT makes its own prediction.…”
Section: Random Forest (Rf)mentioning
confidence: 99%
“…Following this, the testing dataset is employed to introduce novel inputs to the system, thereby evaluating its precision and efficacy. This testing phase holds significant importance as it verifies the model's capability to apply learned knowledge to novel or previously unseen data (Ameer et al, 2019;Simu et al, 2020;Yarragunta et al, 2021;Gupta et al, 2023). Through this study endeavour, it is expected to identify the most accurate predictive model for forecasting employee turnover using air pollution data, while also contributing valuable insights for future research endeavours in this field.…”
Section: Introductionmentioning
confidence: 96%
“…Air quality prediction accuracy could be enhanced by using advanced machine learning algorithms like CatBoost [21]. CatBoost is a type of gradient-boosting algorithm adept at working with complex, highdimensional data sets like those used for modeling urban air quality [22].…”
Section: Introductionmentioning
confidence: 99%