2021
DOI: 10.1016/j.neucom.2020.02.079
|View full text |Cite
|
Sign up to set email alerts
|

Using a low correlation high orthogonality feature set and machine learning methods to identify plant pentatricopeptide repeat coding gene/protein

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…Hence, these features had to be removed before analyzing the importance of the others. Furthermore, another test also had to be performed first by comparing the precisions under different thresholds, in order to determine the optimum threshold for removing the redundant features [87]. The results are shown in Figure 4D, from which it can be observed that the amplitudes of the four matrices begin to flatten when the threshold is larger than 0.7.…”
Section: Resultsmentioning
confidence: 99%
“…Hence, these features had to be removed before analyzing the importance of the others. Furthermore, another test also had to be performed first by comparing the precisions under different thresholds, in order to determine the optimum threshold for removing the redundant features [87]. The results are shown in Figure 4D, from which it can be observed that the amplitudes of the four matrices begin to flatten when the threshold is larger than 0.7.…”
Section: Resultsmentioning
confidence: 99%
“…Maximum-Relevance-Maximum-Distance in [32,33] and Analysis of Variance (ANOVA) in [34] are typical feature selection approaches. For optimum feature representation, [35][36][37] used the principal component analysis (PCA) and misclassifcation error (MCE) to extract optimal feature representation for pentatricopeptide-repeat proteins prediction and got 97.9% accuracy. Li et al in [33] used the above method to design a model for the prediction of anticancer peptide sequences with 19-dimensional attributes.…”
Section: Introductionmentioning
confidence: 99%
“…MRMD ( Zou et al, 2016 ; Ao et al, 2020 ; Li et al, 2020a ; Li et al, 2020b ; Meng et al, 2020 ) and ANOVA ( Anderson, 2001 ; Lv et al, 2019 ) are standard feature selection methods. For optimal feature identification, ( Feng et al, 2021 ) uses the PCA and MCE methods to make the features orthogonal and obtain the core feature set with the minimum 10-dimensional attributes for PPR gene identification and realized 97.9% accuracy. ( Li et al, 2020b ) used a 19-dimensional feature model to classify anticancer peptide sequences.…”
Section: Introductionmentioning
confidence: 99%