A Cheap Feature Selection Approach for the <i>K</i>-Means Algorithm

Capó, Marco; Pérez, Aritz; Lozano, José A.

doi:10.1109/tnnls.2020.3002576

Cited by 23 publications

(9 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After calculation, F(4, 12) = 2.4801 when α = 0.1, and then, we can compute that F = 8.5221 for Classifier KNN and F = 10.0225 for Classifier C4.5. It is apparent to obtain that the two values are greater than the critical value F (4,12). This result exhibits that five methods are significantly different.…”

Section: Discussionmentioning

confidence: 86%

See 1 more Smart Citation

Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

Yuan

2021

Complex Intell. Syst.

View full text Add to dashboard Cite

Feature selection based on the fuzzy neighborhood rough set model (FNRS) is highly popular in data mining. However, the dependent function of FNRS only considers the information present in the lower approximation of the decision while ignoring the information present in the upper approximation of the decision. This construction method may lead to the loss of some information. To solve this problem, this paper proposes a fuzzy neighborhood joint entropy model based on fuzzy neighborhood self-information measure (FNSIJE) and applies it to feature selection. First, to construct four uncertain fuzzy neighborhood self-information measures of decision variables, the concept of self-information is introduced into the upper and lower approximations of FNRS from the algebra view. The relationships between these measures and their properties are discussed in detail. It is found that the fourth measure, named tolerance fuzzy neighborhood self-information, has better classification performance. Second, an uncertainty measure based on the fuzzy neighborhood joint entropy has been proposed from the information view. Inspired by both algebra and information views, the FNSIJE is proposed. Third, the K–S test is used to delete features with weak distinguishing performance, which reduces the dimensionality of high-dimensional gene datasets, thereby reducing the complexity of high-dimensional gene datasets, and then, a forward feature selection algorithm is provided. Experimental results show that compared with related methods, the presented model can select less important features and have a higher classification accuracy.

show abstract

Section: Discussionmentioning

confidence: 86%

“…Feature selection is an important data preprocess in the fields of granular computing and artificial intelligence [1][2][3][4][5][6]. Its main goal is to reduce redundant features and simplify the complexity of the classification model, thereby improving the generalization ability of classification model [7][8][9][10][11][12].…”

Section: Introductionmentioning

confidence: 99%

Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

Yuan

2021

Complex Intell. Syst.

View full text Add to dashboard Cite

show abstract

“…Laplacian scores ranks the features based on the intrinsic characteristic and distinction from other features. 40 The nearest neighbor graph G can be calculated using the nearest neighbor method, as expressed in equation (3). where G ij is the value of ij node, x i and x j is the sample i and j , m is the number of samples.…”

Section: Methodsmentioning

confidence: 99%

Crack damage monitoring for compressor blades based on acoustic emission with novel feature and hybridized feature selection

Song

2022

Structural Health Monitoring

View full text Add to dashboard Cite

Nowadays, acoustic emission (AE) has been widely used to the nondestructive evaluation (NDE) and structural health monitoring (SHM) for compressor blades. However, traditional AE features and feature selection methods are generally difficult to identify the cracked blades of compressor due to its complex structure and background noise. To solve this problem, the crack damage monitoring method based on novel feature and hybridized feature selection is proposed to identify crack of compressor blades, which is aimed at improving the crack identification accuracy with optimal features. First, the novel feature of spectral centroid with energy shift (SCES) is established. Besides, the hybridized feature selection method is proposed based on Laplacian random forest scores (LRFS), which can evaluate and select features adaptively. By fusing information of selected features from AE sensors, the long short-term memory (LSTM) network is used to classify cracked blades. The proposed method is applied experimentally to identify cracks at different speeds and locations of AE sensors, which has the average accuracy of 98.93%. The comparative results demonstrate the effectiveness and superiority of the proposed method in AE-based SHM for compressor blades under different working conditions.

show abstract

“…(3) The clustering algorithm based on cluster center selection represented by K-means is easily affected by the initial cluster center selection. It results in unstable clustering results and makes it easy to fall into the optimal local solution [22].…”

Section: Credit Rating Classificationmentioning

confidence: 99%

FAWPA: A FAW Attack Protection Algorithm Based on the Behavior of Blockchain Miners

Zhang

Chen

et al. 2022

Sensors

View full text Add to dashboard Cite

Blockchain has become one of the key techniques for the security of the industrial internet. However, the blockchain is vulnerable to FAW (Fork after Withholding) attacks. To protect the industrial internet from FAW attacks, this paper proposes a novel FAW attack protection algorithm (FAWPA) based on the behavior of blockchain miners. Firstly, FAWPA performs miner data preprocessing based on the behavior of the miners. Then, FAWPA proposes a behavioral reward and punishment mechanism and a credit scoring model to obtain cumulative credit value with the processed data. Moreover, we propose a miner’s credit classification mechanism based on fuzzy C-means (FCM), which combines the improved Aquila optimizer (AO) with strong solving ability. That is, FAWPA combines the miner’s accumulated credit value and multiple attack features as the basis for classification, and optimizes cluster center selection by simulating Aquila’s predation behavior. It can improve the solution update mechanism in different optimization stages. FAWPA can realize the rapid classification of miners’ credit levels by improving the speed of identifying malicious miners. To evaluate the protective effect of the target mining pool, FAWPA finally establishes a mining pool and miner revenue model under FAW attack. The simulation results show that FAWPA can thoroughly and efficiently detect malicious miners in the target mining pool. FAWPA also improves the recall rate and precision rate of malicious miner detection, and it improves the cumulative revenue of the target mining pool. The proposed algorithm performs better than ND, RSCM, AWRS, and ICRDS.

show abstract

A Cheap Feature Selection Approach for the K-Means Algorithm

Cited by 23 publications

References 37 publications

Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

Crack damage monitoring for compressor blades based on acoustic emission with novel feature and hybridized feature selection

FAWPA: A FAW Attack Protection Algorithm Based on the Behavior of Blockchain Miners

Contact Info

Product

Resources

About