2022
DOI: 10.47065/bits.v4i2.2022
|View full text |Cite
|
Sign up to set email alerts
|

Seleksi Fitur Menggunakan Eigen Vector Untuk Peningkatan Kinerja K-Means Clustering Dalam Pengelompokan Data

Abstract: The large number of data set attributes from the data grouping process with K-Means Clustering can affect the number of iterations produced. In this research, Eigen Vector is used to perform feature selection on the data set. The selected data set is then clustered using K-Means Clustering. The data set used in this research is the Wine Quality Dataset obtained from the UCI Machine Learning Repository, with 11 attributes, 4898 data records and 7 attribute classes. Then the South German Credit Dataset was obtai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 7 publications
0
0
0
Order By: Relevance
“…Generally, one should select the most presentative ones from all operation parameters. There are many feature selection algorithms in the open literature; for example, K-means [7], principal component analysis [8], singular value decomposition [9], Pearson correlation [10], mutual information [11], trees [12], and many others. Among these existing popular feature selection methods, random forest (RF) [13] is very suitable for real-time implementation because of the fast computation time and simple parameter setting.…”
Section: Introductionmentioning
confidence: 99%
“…Generally, one should select the most presentative ones from all operation parameters. There are many feature selection algorithms in the open literature; for example, K-means [7], principal component analysis [8], singular value decomposition [9], Pearson correlation [10], mutual information [11], trees [12], and many others. Among these existing popular feature selection methods, random forest (RF) [13] is very suitable for real-time implementation because of the fast computation time and simple parameter setting.…”
Section: Introductionmentioning
confidence: 99%