2022
DOI: 10.1016/j.procs.2022.08.021
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of unsupervised and supervised machine learning algorithms to predict water pollutions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 9 publications
0
6
0
Order By: Relevance
“…The research used the k-means clustering algorithm, a well-established method in data analysis for identifying patterns in datasets, notable for its guaranteed convergence (Piegari et al, 2023). According to Zamri et al, (2022), the elbow and silhouette approaches were used to uncover clusters in the dataset that might reveal concealed subsurface units. The datasets underwent principal component analysis (PCA) to reduce the dimensions while retaining key traits and reducing complexity.…”
Section: K-mean Clusteringmentioning
confidence: 99%
See 1 more Smart Citation
“…The research used the k-means clustering algorithm, a well-established method in data analysis for identifying patterns in datasets, notable for its guaranteed convergence (Piegari et al, 2023). According to Zamri et al, (2022), the elbow and silhouette approaches were used to uncover clusters in the dataset that might reveal concealed subsurface units. The datasets underwent principal component analysis (PCA) to reduce the dimensions while retaining key traits and reducing complexity.…”
Section: K-mean Clusteringmentioning
confidence: 99%
“…Using models of P-wave velocity (Vp) and resistivity with ML in the same place for subsurface exploration makes it possible to nd a clear link between physical properties at certain sites (Piegari et al, 2023;Zamri et al, 2022). This approach facilitates the classi cation of the surveyed region based on its distinctive lithology.…”
Section: Introductionmentioning
confidence: 99%
“…Its purpose is to ensure uniformity in scale and value distribution across all variables, thereby promoting equitable contributions to model development and diminishing potential bias stemming from variable dominance. In this context, the data underwent a transformation to attain a mean of zero and a stan-dard deviation of 1, employing the StandardScaler method available in the Scikit-learn library [25,49,50].…”
Section: Pre-processing Of Data and Statistical Analysismentioning
confidence: 99%
“…Nonetheless, machine learning algorithms rely on a single estimation model and may tend to overfit when confronted with limited training data, as observed with KNN, SVM, and RF [8]. To alleviate such concerns, pre-processing input data through normalization, utilizing extensive databases [25], and applying data partitioning methods like K-fold [26] can effectively mitigate the risks of overfitting and underfitting during the training process, thereby bolstering the models' capacity for generalization. Integrating machine learning with data from UAVs and satellites stands as a promising strategy for monitoring plant height dynamics at a large experimental scale.…”
Section: Introductionmentioning
confidence: 99%
“…The main reason was that the water quality in the flood season was uncertain due to the influence of Water pollution prediction has always been a hot research topic for scholars. Zamri et al compared the performance of 10 machine learning algorithms in pollution source classification, and found that RF algorithm has high performance in pollution source classification prediction (with 98.78% accuracy) [51]. The main reason was that RF can integrate multiple decision tree models to provide more stable prediction.…”
Section: Plos Onementioning
confidence: 99%