2015
DOI: 10.1007/s10844-014-0353-0
|View full text |Cite
|
Sign up to set email alerts
|

An efficient and flexible scanning of databases of protein secondary structures

Abstract: Protein secondary structure describe protein construction in terms of regular spatial shapes, including alpha-helices, beta-strands, and loops, which protein amino acid chain can adopt in some of its regions. This information is supportive for protein classification, functional annotation, and 3D structure prediction. The relevance of this information and the scope of its practical applications cause the requirement for its effective storage and processing. Relational databases, widely-used in commercial syste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
5

Relationship

2
8

Authors

Journals

citations
Cited by 27 publications
(8 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…Feature representation, fusion, 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 , 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 and selection 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 are the key steps in the machine learning process. In this paper, we propose and employ three feature representation algorithms, including PS(k-mer)NP, PCPs, 10 and RFHC-GACs.…”
Section: Methodsmentioning
confidence: 99%
“…Feature representation, fusion, 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 , 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 and selection 42 , 43 , 44 , 45 , 46 , 47 , 48 , 49 are the key steps in the machine learning process. In this paper, we propose and employ three feature representation algorithms, including PS(k-mer)NP, PCPs, 10 and RFHC-GACs.…”
Section: Methodsmentioning
confidence: 99%
“…By combining multiple weak classifiers, the final results can be voted or averaged to obtain an overall model with higher accuracy, better general performance, and resistance to overfitting. This algorithm has been extensively used in bioinformatics and other areas, and has been confirmed to be an effective modeling technique in various domains (Ding et al, 2016a,b;Mrozek et al, 2016;Qiu et al, 2016;Wang et al, 2017;Wei et al, 2017a,b,c;Yu et al, 2017a;Zheng et al, 2017;Tang et al, 2018Tang et al, , 2019aXue et al, 2018;Degenhardt et al, 2019;Xu et al, 2019). In this study, the scikit-learn toolkit, available at https://scikit-learn.org, was used to establish the models.…”
Section: Algorithmmentioning
confidence: 99%
“…In [33] and [34], we reported the PSS-SQL (Protein Secondary Structure -Structured Query Language) for searching protein similarities on the basis of their secondary structures in the Microsoft SQL Server relational database. The newest version of the PSS-SQL search engine [35] utilizes a multi-threaded alignment procedure, which allows effective querying on DBMSs hosted on computers with multi-core CPUs. However, capabilities of the PSS-SQL are limited to processing only the secondary structures of proteins.…”
Section: B Related Workmentioning
confidence: 99%