2009 3rd International Conference on Bioinformatics and Biomedical Engineering 2009
DOI: 10.1109/icbbe.2009.5162487
|View full text |Cite
|
Sign up to set email alerts
|

Chaos Game Representation for Discriminating Thermophilic from Mesophilic Protein Sequences

Abstract: Can sequence analysis tell us about the function of protein? A basic question in protein science is which kind of proteins extent thermostability. Chaos game representation (CGR) can investigate the patterns hiding in protein sequence, visually revealing previously unknown structure. In this paper, we convert every protein sequence into a 20-dimensional vector by CGR algorithm, and based on these vectors we discriminate thermophiles from mesophiles using support vector machine (SVM). The overall accuracy achie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
3
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 20 publications
1
3
0
Order By: Relevance
“…For our updated new data here, for the 24-dimensional vector, by using genetic algorithm, we find C = 578.524063; = 7.822955 be the best one and we get ACC value is 0.8792 and MCC is 0.7587. These results are very similar with [66].…”
Section: B Results With Svmsupporting
confidence: 94%
See 1 more Smart Citation
“…For our updated new data here, for the 24-dimensional vector, by using genetic algorithm, we find C = 578.524063; = 7.822955 be the best one and we get ACC value is 0.8792 and MCC is 0.7587. These results are very similar with [66].…”
Section: B Results With Svmsupporting
confidence: 94%
“…The third author has already tested the first group experiment in [66], in which the results are ACC=0.8712 and MCC=0.745. For our updated new data here, for the 24-dimensional vector, by using genetic algorithm, we find C = 578.524063; = 7.822955 be the best one and we get ACC value is 0.8792 and MCC is 0.7587.…”
Section: B Results With Svmmentioning
confidence: 98%
“…A support vector machine (SVM) is an effective tool for classification and prediction, which has been used in various fields related to protein function prediction, such as prediction of thermal protein (Hu et al, 2009) and soluble protein (Susan, Abhijit, Bhaskar, Valadi, & Petety, 2006). However, it is still a fresh idea for Xoo prediction.…”
Section: Introductionmentioning
confidence: 99%
“…One of the most frequently used feature extraction methods is the so-called FCGR, in which the CGR image is split into small grids and the frequencies of points falling into each grid are taken as the feature of the corresponding protein sequence. For example, in [34][35][36][37][38]41], the CGR image of a protein sequence was split into 24 grids, and the frequencies of points falling into 24 grids are counted and taken as the numerical characteristics of the protein sequence. Under this procedure, a protein sequence can be converted into a 24-dimensional vector.…”
Section: Introductionmentioning
confidence: 99%