2018
DOI: 10.1016/j.eswa.2018.07.029
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised graph-based feature selection via subspace and pagerank centrality

Abstract: Feature selection has become an indispensable part of intelligent systems, especially with the proliferation of high dimensional data. It identifies the subset of discriminative features leading to better learning performances, i.e., higher learning accuracy, lower computational cost and significant model interpretability. This paper proposes a new efficient unsupervised feature selection method based on graph centrality and subspace learning called UGFS for 'Unsupervised Graph-based Feature Selection'. The me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
22
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(23 citation statements)
references
References 33 publications
0
22
0
1
Order By: Relevance
“…The drawbacks of the ECFS algorithm have motivated the development of the UGFS method [15], where features are graph nodes linked according to subspace preference clusters [6], i.e., features that correspond to significant cluster discrimination are linked. According to subspace learning [28], [41], the variance of the k-nearest neighbors of each data point indicate the most relevant features.…”
Section: A Feature Selectionmentioning
confidence: 99%
See 3 more Smart Citations
“…The drawbacks of the ECFS algorithm have motivated the development of the UGFS method [15], where features are graph nodes linked according to subspace preference clusters [6], i.e., features that correspond to significant cluster discrimination are linked. According to subspace learning [28], [41], the variance of the k-nearest neighbors of each data point indicate the most relevant features.…”
Section: A Feature Selectionmentioning
confidence: 99%
“…These latter are extracted and located on a graph, then the pageRank centrality measure is used to rank features mapped onto this graph. The UGFS performed significantly better than other methods, although three drawbacks have been noted: (i) the elements in a k-nearest neighbors set may belong to different clusters, and therefore, the search for cluster discriminating features may be unduly affected; (ii)considering all data points for feature combination may unduly change the results, especially if the data set contains outliers; and (iii) the correlation between features is not exploited as a means to disfavor the redundant features [15].…”
Section: A Feature Selectionmentioning
confidence: 99%
See 2 more Smart Citations
“…Graph data structures have been used widely to represent a set of entities with their connections, including, but not limited to, social network analysis (Scott, 1988;Backstrom and Kleinberg, 2014;Ji et al, 2016), and data mining (Al-Nabki et al, 2017b). Henni et al (2018) used a graph-based approach to build an unsupervised feature selection method, whereas nodes correspond to features, and the graph edges captured the relationship between those features. Next, to assign an importance score for each feature, they addressed several graph centrality measures, and, in particular, PageRank algorithm.…”
Section: Link-based Rankingmentioning
confidence: 99%