2022
DOI: 10.1016/j.ecss.2022.108053
|View full text |Cite
|
Sign up to set email alerts
|

A multi-scale feature selection approach for predicting benthic assemblages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 79 publications
0
7
0
Order By: Relevance
“…Despite being an initial study, the results here presented can play a key role in supporting the implementation of conservation/management plans for bait harvesting activities in this coastal lagoon and elsewhere, namely estuarine systems where bait harvesting is more intense and relevant from an economic, social, and ecological perspective. The use of statistical methods, such as the Boruta selection function used in this study, can also be applied in other areas of marine science, such as benthic habitat mapping [78]. The selection performed by Boruta helps to achieve more robust and accurate models, as well as to understand which explanatory variables (e.g., elements and FA) characterize each categorical response variable (e.g., location)."…”
Section: Discussionmentioning
confidence: 99%
“…Despite being an initial study, the results here presented can play a key role in supporting the implementation of conservation/management plans for bait harvesting activities in this coastal lagoon and elsewhere, namely estuarine systems where bait harvesting is more intense and relevant from an economic, social, and ecological perspective. The use of statistical methods, such as the Boruta selection function used in this study, can also be applied in other areas of marine science, such as benthic habitat mapping [78]. The selection performed by Boruta helps to achieve more robust and accurate models, as well as to understand which explanatory variables (e.g., elements and FA) characterize each categorical response variable (e.g., location)."…”
Section: Discussionmentioning
confidence: 99%
“…A variable is deemed important if it consistently contributes more to the model than its shadow variable. Degenhardt et al, 2019 found that the Boruta algorithm generally outperformed other selection methods, and previous successful applications can be found in Li et al, 2016, Diesing andThorsnes, (2017) and Nemani et al, 2021. Variables identified as "important" or "tentative" were selected for model training here.…”
Section: Model Developmentmentioning
confidence: 97%
“…For both the assemblages and for each marine algae taxa model, a Boruta Feature Selection (Kurse and Rudnicki, 2010) algorithm was run separately to include terrain features grouped by scale (window of analysis: 3 × 3, 13 × 13, 35 × 35 cells) in order to reduce the number of candidate terrain features and promote model parsimony and support performance (Nemani et al, 2021). Important variables are identified by the Boruta wrapper as it compares the importance of a variable with a randomly shuffled version containing the same distribution of values (i.e., "shadow features").…”
Section: Model Developmentmentioning
confidence: 99%
“…Compared to other machine learning models, the Extreme Gradient Boosting (XGBoost) model based on Decision Trees (DT) is a flexible and versatile tool capable of solving a wide range of regression and classification problems. The non-linear properties of the estimated variables are accurately captured at a low computational cost and without over-fitting, as shown by Nemani et al (2022). The XGBoost model differs from Random Forest (RF) in that it combines both DT and Gradient Boosting (GB) to aggregate the weak learners' predictions and strengthen the strong learners through effective training strategies (Fan et al, 2019).…”
Section: Introductionmentioning
confidence: 99%