K-Means Clustering Versus Validation Measures: A Data-Distribution Perspective

Xiong, Hui; Wu, Junjie; Chen, Jian

doi:10.1109/tsmcb.2008.2004559

Cited by 230 publications

(94 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The results proved that the algorithm is reliable. Xiong et al [7] studied K-Means in data -distribution perspective. They described many validation measures such as CV, purity, entropy and F-measure.…”

Section: Related Workmentioning

confidence: 99%

“…We made an empirical study besides review of literature to prove that the Fuzzy K-Means exhibits better clustering performance than K-Means. The literature on these two and their comparison besides other derivatives of them [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23] and [24] can be found in section IV.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Experiments on Hypothesis "Fuzzy K-Means is Better than K-Means for Clustering"

Sivarathri¹,

Govardhan²

2014

IJDKP

View full text Add to dashboard Cite

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Experiments on Hypothesis "Fuzzy K-Means is Better than K-Means for Clustering"

Sivarathri¹,

Govardhan²

2014

IJDKP

View full text Add to dashboard Cite

show abstract

“…Além disso, foram gerados 100 elementos para cada combinação de EAs. Após a geração dos clusters, a qualidade de cada um deles foi avaliada utilizando métricas de validação como Pureza(P), que valida se existem diferentes elementos em cada cluster, e F-Measure (F), que avalia a qualidade de cada cluster com base nos cálculos de precision e recall [Xiong et al 2009]. A Tabela 2 apresenta os resultados obtidos em cada métrica para cada um dos algoritmos testados.…”

Section: Abordagem Propostaunclassified

Agrupamento e Recomendação de Objetos de Aprendizagem no Padrão IEEE-LOM Considerando Estilos de Aprendizagem

Mendes

Carvalho

Dorça

et al. 2017

Anais Do XXVIII Simpósio Brasileiro De Informática Na Educação (SBIE 2017)

View full text Add to dashboard Cite

Abstract. Repositories of educational materials grow larger as learning objects are created continuously, which makes the search for specific items a challenging task. In this way, such objects should be organized to better support a recommendation process. Thus, this paper presents an approach for clustering educational content from those repositories, based on learning styles. A comparative analysis of three different clustering algorithms was performed and promising results were obtained. Based on the results, a learning objects recommendation process is discussed.Resumo. A grande quantidade de objetos de aprendizagem que são criados continuamente faz com que os repositórios de materiais educacionais fiquem cada vez maiores e a busca de itens específicos se torna um desafio. Dessa forma, faz-se necessário que tais objetos sejam organizados para que sua recomendação seja mais eficiente. Sendo assim, a proposta apresentada neste trabalho utiliza técnicas de clusterização para agrupar conteúdos educacionais em repositórios com base em estilos de aprendizagem. Uma análise comparativa de três diferentes algoritmos de agrupamento foi realizada e resultados promissores foram obtidos. Com base nos resultados, uma proposta de recomendação de objetos de aprendizagemé discutida. IntroduçãoO avanço da computação em diferentes frentes tem criado sistemas cada vez mais dinâmicos e que se adaptamàs necessidades dos usuários. Especialmente no contexto educacional, abordagens que visam melhorar a experiência de aprendizagem utilizando recursos de recuperação e personalização de conteúdo têm surgido.

show abstract

“…Entropy is a commonly used information theoretic external validation measure that measures the purity of the clusters with respect to given external class labels (Xiong et al, 2006). A perfect clustering has an entropy close to 0 which means that every cluster consists of points with only one class label.…”

Section: Clustering Evaluationmentioning

confidence: 99%

A Semi-supervised Learning Framework to Cluster Mixed Data Types

Abdullin

Nasraoui

2012

Proceedings of the International Conference on Knowledge Discovery and Information Retrieval

View full text Add to dashboard Cite

Abstract:We propose a semi-supervised framework to handle diverse data formats or data with mixedtype attributes. Our preliminary results in clustering data with mixed numerical and categorical attributes show that the proposed semi-supervised framework gives better clustering results in the categorical domain. Thus the seeds obtained from clustering the numerical domain give an additional knowledge to the categorical clustering algorithm. Additional results show that our approach has the potential to outperform clustering either domain on its own or clustering both domains after converting them to the same target domain.

show abstract

K-Means Clustering Versus Validation Measures: A Data-Distribution Perspective

Cited by 230 publications

References 25 publications

Experiments on Hypothesis "Fuzzy K-Means is Better than K-Means for Clustering"

Experiments on Hypothesis "Fuzzy K-Means is Better than K-Means for Clustering"

Agrupamento e Recomendação de Objetos de Aprendizagem no Padrão IEEE-LOM Considerando Estilos de Aprendizagem

A Semi-supervised Learning Framework to Cluster Mixed Data Types

Contact Info

Product

Resources

About