2020
DOI: 10.1186/s12859-020-03826-6
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors

Abstract: Background Classification of certain proteins with specific functions is momentous for biological research. Encoding approaches of protein sequences for feature extraction play an important role in protein classification. Many computational methods (namely classifiers) are used for classification on protein sequences according to various encoding approaches. Commonly, protein sequences keep certain labels corresponding to different categories of biological functions (e.g., bacterial type IV sec… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 54 publications
0
2
0
Order By: Relevance
“…Here, we introduce a cumulative voting strategy for selection of variables [ 24 ] corresponding to the 108 components of the extracted feature. The proposed variable selection strategy is divided into seven steps each of which is framed and labeled in a dashed box, as is shown in Figure 1 .…”
Section: Methodsmentioning
confidence: 99%
“…Here, we introduce a cumulative voting strategy for selection of variables [ 24 ] corresponding to the 108 components of the extracted feature. The proposed variable selection strategy is divided into seven steps each of which is framed and labeled in a dashed box, as is shown in Figure 1 .…”
Section: Methodsmentioning
confidence: 99%
“…Such approaches have been shown to significantly improve classification accuracy. The most frequently used methods for the ensemble are to improve and cover classification, are used for classification and regression [71].…”
Section: Decision Tree Classifier (Dtc)mentioning
confidence: 99%