2021
DOI: 10.1016/j.envres.2021.111720
|View full text |Cite
|
Sign up to set email alerts
|

Automating water quality analysis using ML and auto ML techniques

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(5 citation statements)
references
References 6 publications
0
5
0
Order By: Relevance
“…Once the pre-processing stage was concluded, it was followed by the application of the models and the analysis of their metrics. After pre-processing the dataset, we used the PyCaret library [12,13] (an open source Auto-ML in Python), as it is extremely simple and allows it to be replicated in future approaches.…”
Section: Classification Modelsmentioning
confidence: 99%
“…Once the pre-processing stage was concluded, it was followed by the application of the models and the analysis of their metrics. After pre-processing the dataset, we used the PyCaret library [12,13] (an open source Auto-ML in Python), as it is extremely simple and allows it to be replicated in future approaches.…”
Section: Classification Modelsmentioning
confidence: 99%
“…In the case of Naive Bayes (NB), it is a widely used probabilistic classifier that is driven by Bayesian statistics (Banchhor & Srinivasu, 2020). Recently, this classifier has been widely used for classifying the water quality and predicting the states in water resources management (Ali Haghpanah jahromi and Mohammad Taheri, 2017; Neha Radhakrishnan and Anju S Pillai, 2020; Suwadi et al, 2022;Venkata Vara Prasad et al, 2021). Mainly, the NB classifier's performance depends on two identical parameters, including data distribution and kernel function.…”
Section: Models Hyper-parameterizationmentioning
confidence: 99%
“…Mainly, the NB classifier's performance depends on two identical parameters, including data distribution and kernel function. Commonly, the Gaussian data distribution function is widely used to obtain the highest performance of the classifier (Suwadi et al, 2022;Venkata Vara Prasad et al, 2021). (In contrast to other classifiers, the NB classifier does not require parameter optimization or the setting of any tuning parameters (s).…”
Section: Models Hyper-parameterizationmentioning
confidence: 99%
“…The SMOTE is adopted for a data preprocessing technique and supports the enhancement of the performance of machine learning models by mitigating overfitting problems [35]. For imbalanced water quality and quantity data, the SMOTE has been used to improve data balance for the enhancement of prediction performance using machine learning techniques [36][37][38][39][40]. Furthermore, to improve SMOTE, an adaptive synthetic sampling (ADASYN) was proposed, introducing a density distribution to determine the number of synthetic samples [41].…”
Section: Introductionmentioning
confidence: 99%