2020
DOI: 10.1109/access.2020.3008951
|View full text |Cite
|
Sign up to set email alerts
|

Semi-Supervised Self-Training Feature Weighted Clustering Decision Tree and Random Forest

Abstract: A self-training algorithm is an iterative method for semi-supervised learning, which wraps around a base learner. It uses its own predictions to assign labels to unlabeled data. For a self-training algorithm, the classification ability of the base learner and the estimation of prediction confidence are very important. The classical decision tree as the base learner cannot be effective in a self-training algorithm, because it cannot correctly estimate its own predictions. In this paper, we propose a novel metho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 34 publications
0
9
0
Order By: Relevance
“…The outstanding feature of CBI outline lies in its detail and operability, which has strong guiding and reference significance. Learning state evaluation algorithm mainly calculates learners' mastery of knowledge point type and learners' attention coefficient, determines learners' learning type, and provides targeted guidance for learners [5]. The element X in the matrix represents the scoring rate J of the ith learner.…”
Section: Realization Of Business English Professional Ability Trainingmentioning
confidence: 99%
“…The outstanding feature of CBI outline lies in its detail and operability, which has strong guiding and reference significance. Learning state evaluation algorithm mainly calculates learners' mastery of knowledge point type and learners' attention coefficient, determines learners' learning type, and provides targeted guidance for learners [5]. The element X in the matrix represents the scoring rate J of the ith learner.…”
Section: Realization Of Business English Professional Ability Trainingmentioning
confidence: 99%
“…For the experiment, logistic regression, support vector machine (SVM), decision tree, and random forest algorithms were chosen. e literature revealed that decision tree, logistic regression, random forest, and SVM algorithms are the leading classical state-ofthe-art detection algorithms [42][43][44]. e algorithms were used to train and validate the fraud detection model, following the train, test, and predict technique [45].…”
Section: Classification Algorithmsmentioning
confidence: 99%
“…The perpendicular line between the two class centroids The perpendicular line between the two cluster centers en, we use the RELIEF-F algorithm to calculate the weights of four features, which are 0.09, 0.14, 0.34, and 0.39, respectively. In the process of k-means clustering, the distances between instances and cluster centers are calculated by (10), where p indicates the number of features and w l indicates the weight of the lth feature. We obtain 6 misclassified samples, and the specific results are shown in Table 2.…”
Section: Class Centroids Cluster Centersmentioning
confidence: 99%
“…en remove features whose weights are less than 1/5 of the maximum. Finally, the distance is calculated according to formula (10). Fun2 uses the center of each class as the initial cluster center of k-means and the outputs of k-means as the partition results.…”
Section: Numerical Datamentioning
confidence: 99%
See 1 more Smart Citation