2020
DOI: 10.20944/preprints202008.0254.v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unsupervised Feature Selection Using Recursive k-Means Silhouette Elimination (RkSE): A Two-Scenario Case Study for Fault Classification of High-Dimensional Sensor Data

Abstract: Feature selection is a crucial step to overcome the curse of dimensionality problem in data mining. This work proposes Recursive k-means Silhouette Elimination (RkSE) as a new unsupervised feature selection algorithm to reduce dimensionality in univariate and multivariate time-series datasets. Where k-means clustering is applied recursively to select the cluster representative features, following a unique application of silhouette measure for each cluster and a user-defined threshold as the feature selection o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 34 publications
0
2
0
Order By: Relevance
“…The comparison included the application of the chosen ML and DL approaches using different features extracted or selected via numerous feature extraction and selection methods, such as manually extracting time domain features for each sliding window, such as the mean, variance, standard deviation, and signal to noise ratio. PCA was also applied directly using the raw multivariate time domain sensor data without dimensionality reduction, and by using the Recursive k -Means Silhouette Elimination (R k SE) feature selection and dimensionality reduction algorithm [ 36 ]. Finally, the trained models and saved thresholds from each experiment could be easily used to achieve run-time predictions of new samples at real-time.…”
Section: Hydraulic System Fdd Overviewmentioning
confidence: 99%
“…The comparison included the application of the chosen ML and DL approaches using different features extracted or selected via numerous feature extraction and selection methods, such as manually extracting time domain features for each sliding window, such as the mean, variance, standard deviation, and signal to noise ratio. PCA was also applied directly using the raw multivariate time domain sensor data without dimensionality reduction, and by using the Recursive k -Means Silhouette Elimination (R k SE) feature selection and dimensionality reduction algorithm [ 36 ]. Finally, the trained models and saved thresholds from each experiment could be easily used to achieve run-time predictions of new samples at real-time.…”
Section: Hydraulic System Fdd Overviewmentioning
confidence: 99%
“…The comparison includes the application of the chosen ML and DL approaches using different features extracted or selected via numerous feature extraction and selection methods such as, manually extracting time domain features for each sliding window such as the mean, variance, standard deviation and signal to noise ratio. And applying PCA directly using the raw multivariate time domain sensor data without dimensionality reduction, as well as using Recursive k-Means Silhouette Elimination (RkSE) feature selection and dimensionality reduction algorithm [20] . Finally, the trained models and saved thresholds from each experiment can be easily used to achieve run-time predictions of new samples at real-time.…”
Section: Hydraulic System Fdd Overviewmentioning
confidence: 99%