2014
DOI: 10.1039/c4mb00316k
|View full text |Cite
|
Sign up to set email alerts
|

Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis

Abstract: The bacteriophage virion proteins play extremely important roles in the fate of host bacterial cells. Accurate identification of bacteriophage virion proteins is very important for understanding their functions and clarifying the lysis mechanism of bacterial cells. In this study, a new sequence-based method was developed to identify phage virion proteins. In the new method, the protein sequences were initially formulated by the g-gap dipeptide compositions. Subsequently, the analysis of variance (ANOVA) with i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
130
1
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 162 publications
(136 citation statements)
references
References 50 publications
4
130
1
1
Order By: Relevance
“…The feature selection method improves the performance by removing some redundant features in high-dimensional data [6265]. In this study, we propose a two-step feature selection approach to select the most important features for predicting the phenotypic effects of SAVs.…”
Section: Methodsmentioning
confidence: 99%
“…The feature selection method improves the performance by removing some redundant features in high-dimensional data [6265]. In this study, we propose a two-step feature selection approach to select the most important features for predicting the phenotypic effects of SAVs.…”
Section: Methodsmentioning
confidence: 99%
“…The optimal feature subset will shorten the training and utilization times, reduce the measurement and storage requirements, avert overfitting and improve prediction performance [30]. Up to date, many effective feature selection techniques such as the analysis of variance [31], max-relevance-max-distance [32], minimum redundancy maximum relevance [33], principal component analysis [34] and recursive feature elimination algorithm [35, 36] have been proposed to reduce effects from noise or irrelevant features and provided good prediction results.…”
Section: Methodsmentioning
confidence: 99%
“…Jackknife test is widely used to evaluate the performance of predictors in protein subcellular localization and other field [65][66][67][68][69][70][71][72][73][74][75][76][77][78][79][80][81][82]. To make fair comparisons with existing methods, the jackknife test is also employed here.…”
Section: Measurementsmentioning
confidence: 99%