2020
DOI: 10.3390/w12041191
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning Approaches for Predicting Health Risk of Cyanobacterial Blooms in Northern European Lakes

Abstract: Cyanobacterial blooms are considered a major threat to global water security with documented impacts on lake ecosystems and public health. Given that cyanobacteria possess highly adaptive traits that favor them to prevail under different and often complicated stressor regimes, predicting their abundance is challenging. A dataset from 822 Northern European lakes is used to determine which variables better explain the variation of cyanobacteria biomass (CBB) by means of stepwise multiple linear regression. Chlor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 49 publications
0
14
0
Order By: Relevance
“…Path analysis is an extension of the multiple linear regression analysis, which can identify and quantify the direct and indirect effects of independent variables on dependent variables [75]. It is a powerful tool used to analyze the relationship between multiple variables and is widely used in many fields.…”
Section: Path Analysismentioning
confidence: 99%
“…Path analysis is an extension of the multiple linear regression analysis, which can identify and quantify the direct and indirect effects of independent variables on dependent variables [75]. It is a powerful tool used to analyze the relationship between multiple variables and is widely used in many fields.…”
Section: Path Analysismentioning
confidence: 99%
“…These surfaces were designed using the posterior distribution parameters β ij ( β 0,ij , β 1,ij , β 2,ij ) that are specific for each lake. In Figs 7 (b), (e), (h) and (k), we show the same plots as before, but without the observed values scatter plot; this time we visualize two horizontal planes that correspond to the three distinct health risk levels (low-medium-high, with thresholds at 2 and 10 mg/L), as defined by WHO, after converting cell counts into concentrations ( Mellios et al, 2020 ). From these posterior probability surfaces, we can identify the combination of TN and TP concentrations that result in the 50 th percentile of CBB distribution being lower than 2 mg/L (below the bottom horizontal plane-green color), being between 2 and 10 mg/L (between the two planes-yellow color) and being above 10 mg/L (above the top horizontal plane-red color).…”
Section: Posterior Probabilities and Exceedance Probability Surfacesmentioning
confidence: 91%
“…1 the spatial distribution of all lakes in the dataset across the European map is shown. More details on the data set are included in Mellios et al (2020) .…”
Section: Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…In binary logistic regression, since the actual value of the dependent variable is present and the predicted value can be calculated, the predicted value can be applied to a confusion matrix that can be compared to the target value [9,10]. It can be obtained sensitivity and precision from the confusion matrix using the actual and predicted values of the logistic regression, and apply it to algal blooms to create a summary of indicators such as sensitivity and precision including accuracy [11][12][13].…”
Section: Introductionmentioning
confidence: 99%