2021
DOI: 10.1111/jvs.13042
|View full text |Cite
|
Sign up to set email alerts
|

A comparative study of hard clustering algorithms for vegetation data

Abstract: Questions Which clustering algorithms are most effective according to different cluster validity evaluators? Which distance or dissimilarity measure is most suitable for clustering algorithms? Location Hyrcanian forest, Iran (Asia), Virginia region forest, United States (North America), beech forests, Ukraine (Europe). Methods We tested 25 clustering algorithms with nine vegetation data sets comprised of three real data sets and six simulated data sets exhibiting different cluster separation values. The cluste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 70 publications
(118 reference statements)
0
6
0
Order By: Relevance
“…The classifications were undertaken in three steps: (1) pre‐processing involved the selection of a distance measure and normalization of the data; (2) cluster analysis involved the selection and application of the clustering algorithm and its various parameters; (3) cluster validation involved the selection and application of appropriate internal validation techniques to evaluate the quality of the classification. Four clustering algorithms and four validation measures were explored based on demonstrated performance in recent literature (Aho et al, 2008 ; Handl et al, 2005 ; Lengyel et al, 2021 ; Pakgohar et al, 2021 ). We defined a vegetation classification as being comprised of a cluster of plots organized into units with discrete boundaries between them.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The classifications were undertaken in three steps: (1) pre‐processing involved the selection of a distance measure and normalization of the data; (2) cluster analysis involved the selection and application of the clustering algorithm and its various parameters; (3) cluster validation involved the selection and application of appropriate internal validation techniques to evaluate the quality of the classification. Four clustering algorithms and four validation measures were explored based on demonstrated performance in recent literature (Aho et al, 2008 ; Handl et al, 2005 ; Lengyel et al, 2021 ; Pakgohar et al, 2021 ). We defined a vegetation classification as being comprised of a cluster of plots organized into units with discrete boundaries between them.…”
Section: Methodsmentioning
confidence: 99%
“…Lötter et al ( 2013 ) referred to this as “the classification conundrum”. The amount of research available which advocates particular methods, ideologies and approaches to classify vegetation (Feilhauer et al, 2020 ; Lengyel et al, 2021 ; Lortie et al, 2004 ; Lötter et al, 2013 ; Pakgohar et al, 2021 ), reflects the impracticality of the use of one universal approach in all environments. Nevertheless, there is general agreement that expert opinion is needed to select vegetation units at some stage in the classification process (Brown et al, 2013 ; Lötter et al, 2013 ; Mucina, 1997 ) even if this adds subjectivity to the classification, possibly resulting in bias (Lötter et al, 2013 ; Wolda, 1981 ), with little objective validation of clustering results.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To obtain community assembly matrices, for VST transformed data, we used Jaccard dissimilarity index calculated from presence-absence data (Tian et al, 2019); while for Hellinger transformed data, we used the Euclidean dissimilarity index (Pakgohar et al, 2021).…”
Section: Statistical Analysesmentioning
confidence: 99%
“…Regression analysis is to model the spectral and structural information directly with the measured species diversity indices, which is a mature and straightforward algorithm, but the applicability in different regions is poor (Ceballos et al, 2015). The clustering algorithm can evaluate species diversity by grouping trees with similar characteristics based on the biochemical and structural variation of different tree species (Asner et al, 2015;Padilla-Martinez et al, 2020;Pakgohar et al, 2021). Clustering can be used to identify patterns or trends in the distribution and abundance of different species within a forest ecosystem.…”
Section: Introductionmentioning
confidence: 99%