Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics 2017
DOI: 10.1145/3107411.3107416
|View full text |Cite
|
Sign up to set email alerts
|

Determining Optimal Features for Predicting Type IV Secretion System Effector Proteins for Coxiella burnetii

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
4
1
1

Relationship

4
2

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…The total number of features, including elements of vector features, was 1,027. The complete list of these features and the tools and software needed for their computation are presented in Esna Ashari et al (2017, 2018).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The total number of features, including elements of vector features, was 1,027. The complete list of these features and the tools and software needed for their computation are presented in Esna Ashari et al (2017, 2018).…”
Section: Methodsmentioning
confidence: 99%
“…We suspect that the use of these differing feature sets explains the differences in effector predictions by the different algorithms. As a result of the disparities between the results of earlier methods, we assembled all the features used in prior studies and used a multi-level, statistical approach to determine which were the most effective in predicting effector proteins (Esna Ashari et al, 2017, 2018). Because of the number of validated effectors available for L. pneumophila , we then ran a number of experiments on the whole genome of L. pneumophila using our optimal set of features (Esna Ashari et al, 2019).…”
Section: Introductionmentioning
confidence: 99%
“…This proteome contains 2,942 protein sequences and was used as our 134 test set [S2 File]. We calculated the feature values for all the protein sequences in L. pneumophila using different tools 135 and programming languages as described in [11]. We then used our three models for de novo prediction of effector 136 proteins in the L. pneumophila proteome.…”
Section: (C) Machine Learning Models and Validationmentioning
confidence: 99%
“…As a result, recent studies have 25 focused on using prediction approaches such as scoring effector proteins based on their characteristics or using machine 26 learning algorithms [5][6][7][8][9][10]. Because these methods considered different sets of features, we examined their effectiveness 27 in an earlier study and determined a set of optimal features for prediction of T4SS effector proteins [11][12]. By features, 28 we refer here to the characteristics and properties of protein sequences that can be measured and thus assigned binary or 29 continuous numerical values.…”
mentioning
confidence: 99%
“…Instead, a large number of computational methods have been developed for prediction of T4SEs in the last decade, which successfully speed up the process in terms of time and efficiency. These computational approaches can be categorized into two main groups: the first group of approaches infer new effectors based on sequence similarity with currently known effectors (Chen et al, 2010 ; Lockwood et al, 2011 ; Marchesini et al, 2011 ; Meyer et al, 2013 ; Sankarasubramanian et al, 2016 ; Noroy et al, 2019 ) or phylogenetic profiling analysis (Zalguizuri et al, 2019 ), and the second group of approaches involve learning the patterns of known secreted effectors that distinguish them from non-secreted proteins based on machine learning and deep learning techniques (Burstein et al, 2009 ; Lifshitz et al, 2013 ; Zou et al, 2013 ; Wang et al, 2014 ; Ashari et al, 2017 ; Wang Y. et al, 2017 ; Esna Ashari et al, 2018 , 2019a , b ; Guo et al, 2018 ; Xiong et al, 2018 ; Xue et al, 2018 ; Acici et al, 2019 ; Chao et al, 2019 ; Hong et al, 2019 ; Wang J. et al, 2019 ; Li J. et al, 2020 ; Yan et al, 2020 ). In the latter group of methods, Burstein et al ( 2009 ) worked on Legionella pneumophila to identify T4SEs and validated 40 novel effectors which were predicted by machine learning algorithms.…”
Section: Introductionmentioning
confidence: 99%