2012
DOI: 10.1186/1756-0500-5-2101791285670500
|View full text |Cite
|
Sign up to set email alerts
|

An efficient clustering algorithm for partitioning Y-short tandem repeats data

Abstract: BackgroundY-Short Tandem Repeats (Y-STR) data consist of many similar and almost similar objects. This characteristic of Y-STR data causes two problems with partitioning: non-unique centroids and local minima problems. As a result, the existing partitioning algorithms produce poor clustering results.ResultsOur new algorithm, called k-Approximate Modal Haplotypes (k-AMH), obtains the highest clustering accuracy scores for five out of six datasets, and produces an equal performance for the remaining dataset. Fur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
9
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 22 publications
0
9
0
Order By: Relevance
“…However, where the sample size is large and/or there are multiple samples, multi-criteria analyses such as supervised and unsupervised learning methods produce results that are more informative. Indeed, several methods for grouping multiple samples of Y-STR data automatically have been reported (Schlecht et al, 2008 ; Seman et al, 2010a ; 2012 ; 2013a ). In the supervised learning method, Y-STR data can be classified by haplogroup via the decision tree method (Schlecht et al, 2008 ; Seman et al, 2013a ), Bayesian modeling, and support vector machines (Schlecht et al, 2008 ).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, where the sample size is large and/or there are multiple samples, multi-criteria analyses such as supervised and unsupervised learning methods produce results that are more informative. Indeed, several methods for grouping multiple samples of Y-STR data automatically have been reported (Schlecht et al, 2008 ; Seman et al, 2010a ; 2012 ; 2013a ). In the supervised learning method, Y-STR data can be classified by haplogroup via the decision tree method (Schlecht et al, 2008 ; Seman et al, 2013a ), Bayesian modeling, and support vector machines (Schlecht et al, 2008 ).…”
Section: Introductionmentioning
confidence: 99%
“…In the supervised learning method, Y-STR data can be classified by haplogroup via the decision tree method (Schlecht et al, 2008 ; Seman et al, 2013a ), Bayesian modeling, and support vector machines (Schlecht et al, 2008 ). Similarly, unsupervised learning methods can be used to cluster Y-STR data by similar genetic distances (Seman et al, 2010a ; 2010b ; 2010c ; 2010d ; 2012 ).…”
Section: Introductionmentioning
confidence: 99%
“…The outcomes of this result indicate that the Y-STR data are quite unique compared to other categorical data, characterizing many similar and almost similar objects. This uniqueness of the Y-STR data has caused the existing clustering algorithms to produce poor clustering results (see the detailed problems of clustering Y-STR data in [15]). As a result, we have recently proposed a new algorithm called -Approximate Modal Haplotype ( -AMH) for clustering six Y-STR data [15].…”
Section: Introductionmentioning
confidence: 99%
“…This uniqueness of the Y-STR data has caused the existing clustering algorithms to produce poor clustering results (see the detailed problems of clustering Y-STR data in [15]). As a result, we have recently proposed a new algorithm called -Approximate Modal Haplotype ( -AMH) for clustering six Y-STR data [15]. Letting these Y-STR dataset items be a benchmark, the -AMH algorithm has been proven as an efficient clustering algorithm for partitioning Y-STR data.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation