iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance-Pairs and Reduced Alphabet Profile into the General Pseudo Amino Acid Composition

Liu, Bin; Xu, Jinghao; Lan, Xun; Xu, Ruifeng; Zhou, Jiyun; Wang, Xiaolong; Chou, Kuo‐Chen

doi:10.1371/journal.pone.0106691

Cited by 254 publications

(219 citation statements)

References 96 publications

Supporting

Mentioning

219

Contrasting

Order By: Relevance

“…These profile-based methods can significantly improve the protein remote homology detection [7,8], protein fold recognition and so forth. Moreover, added into the amino acid composition category are 3 new modes: they are "DR" [274], "Distance Pair" [271], and "PDT" [270]. DR is the abbreviation for "Distance-based Residue".…”

Section: Category Modementioning

confidence: 99%

“…It is sequence-based method, in which the generated feature vector for protein sequence is based on the distance between residue pairs and has shown better performance for protein remote homology detection. "Distance Pair" method incorporates the amino acid distance pair coupling information and the amino acid reduced alphabet profile into the general pseudo amino acid composition (PseAAC) [108] vector, which is very useful for analysing DNA-binding proteins [15,170,189,275]. PDT is the abbreviation for "physicochemical distance transformation", which can incorporate considerable sequence-order information or important patterns of protein/peptide sequences into Pseudo components [28], which is very useful for conducting various proteome analyses [17, 23, 215-217, 224, 225, 231, 235, 276-289] and genome analysis as well [216,218,220,223,229,255,277,290].…”

Section: Category Modementioning

confidence: 99%

See 1 more Smart Citation

Pse-in-One 2.0: An Improved Package of Web Servers for Generating Various Modes of Pseudo Components of DNA, RNA, and Protein Sequences

Liu¹,

Wu²,

Chou³

2017

Self Cite

138

View full text Add to dashboard Cite

Pse-in-One 2.0 is a package of web-servers evolved from Pse-in-One (Liu, B., Liu, F., Wang, X., Chen, J. Fang, L. & Chou, K.C. Nucleic Acids Research, 2015, 43:W65-W71). In order to make it more flexible and comprehensive as suggested by many users, the updated package has incorporated 23 new pseudo component modes as well as a series of new feature analysis approaches. It is available at http://bioinformatics.hitsz.edu.cn/Pse-in-One2.0/. Moreover, to maximize the convenience of users, provided is also the stand-alone version called "Pse-inOne-Analysis", by which users can significantly speed up the analysis of massive sequences.

show abstract

Section: Category Modementioning

confidence: 99%

Section: Category Modementioning

confidence: 99%

Pse-in-One 2.0: An Improved Package of Web Servers for Generating Various Modes of Pseudo Components of DNA, RNA, and Protein Sequences

Liu¹,

Wu²,

Chou³

2017

Self Cite

138

View full text Add to dashboard Cite

show abstract

“…7-10) above were often used in the literature to measure the prediction quality of a prediction method, they are no longer the best ones because they lack intuitiveness and are not easy to understand for most biologists, particularly the MCC (the Matthews correlation coefficient). To make it easy to read, we adopt an additional four metrics proposed by Chou (Chou 2001a, b;Chen et al 2013;Lin et al 2014;Liu et al 2014;Guo et al 2014):…”

Section: Evaluation Indicesmentioning

confidence: 99%

TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition

Han

et al. 2015

J Membrane Biol

View full text Add to dashboard Cite

Antifreeze proteins (AFPs) are indispensable for living organisms to survive in an extremely cold environment and have a variety of potential biotechnological applications. The accurate prediction of antifreeze proteins has become an important issue and is urgently needed. Although considerable progress has been made, AFP prediction is still a challenging problem due to the diversity of species. In this study, we proposed a new sequence-based AFP predictor, called TargetFreeze. TargetFreeze utilizes an enhanced feature representation method that weightedly combines multiple protein features and takes the powerful support vector machine as the prediction engine. Computer experiments on benchmark datasets demonstrate the superiority of the proposed TargetFreeze over most recently released AFP predictors. We also implemented a user-friendly web server, which is openly accessible for academic use and is available at http://csbio.njust.edu.cn/bioinf/TargetFreeze. TargetFreeze supplements existing AFP predictors and will have potential applications in AFP-related biotechnology fields.

show abstract

“…More recently the notion of reduce alphabet amino acid composition method (RAAAC) was applied by different researchers and achieved remarkable results. Further it has been used in various area of computational biology, such as prediction of DNA-Binding proteins [21], prediction of defensin family and subfamilies [22] and prediction of bioluminescent proteins [23]. Similarly, Feng et al have used RAAAC and Support Vector Machine (SVM) for prediction of HSPs families and obtained maximum overall accuracy 87.82% [2].…”

Section: Introductionmentioning

confidence: 99%

Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC

Ahmad

Kabir

Hayat

2015

Computer Methods and Programs in Biomedicine

View full text Add to dashboard Cite

iDNA-Prot|dis: Identifying DNA-Binding Proteins by Incorporating Amino Acid Distance-Pairs and Reduced Alphabet Profile into the General Pseudo Amino Acid Composition

Cited by 254 publications

References 96 publications

Pse-in-One 2.0: An Improved Package of Web Servers for Generating Various Modes of Pseudo Components of DNA, RNA, and Protein Sequences

Pse-in-One 2.0: An Improved Package of Web Servers for Generating Various Modes of Pseudo Components of DNA, RNA, and Protein Sequences

TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition

Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC

Contact Info

Product

Resources

About