2019
DOI: 10.1109/access.2019.2952738
|View full text |Cite
|
Sign up to set email alerts
|

Improved Prediction of Cell-Penetrating Peptides via Effective Orchestrating Amino Acid Composition Feature Representation

Abstract: Cell-penetrating peptides (CPPs) promote the transport of pharmacologically active molecules, such as nanoparticles, plasmid DNA and short interfering RNA. Accurate prediction of new CPPs is a prerequisite for in-depth study of such molecules. Biological experimental predictions can provide an accurate description of the penetrating properties of CPPs. However, predicting CPPs by wet laboratory experiments is both resource-intensive and time-consuming. Therefore, the development of effective calculation method… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 97 publications
0
10
0
Order By: Relevance
“…In order to distinguish between thermophilic and nonthermophilic proteins, SVM (Ding et al, 2016a,b;He et al, 2018;Qiao et al, 2018;Wei et al, 2018;Fu et al, 2019;Wang et al, 2019b), random forest [RF, (Ding et al, 2017;Wang et al, 2019a)], decision tree (Mohasseb et al, 2018;, and naïve Bayes [NB, (Rajaraman and Chokkalingam, 2014)] methods were used in our experiment. The first two methods were implemented and optimized in the python 3.7 environment with our edited code.…”
Section: Machine Learning Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to distinguish between thermophilic and nonthermophilic proteins, SVM (Ding et al, 2016a,b;He et al, 2018;Qiao et al, 2018;Wei et al, 2018;Fu et al, 2019;Wang et al, 2019b), random forest [RF, (Ding et al, 2017;Wang et al, 2019a)], decision tree (Mohasseb et al, 2018;, and naïve Bayes [NB, (Rajaraman and Chokkalingam, 2014)] methods were used in our experiment. The first two methods were implemented and optimized in the python 3.7 environment with our edited code.…”
Section: Machine Learning Methodsmentioning
confidence: 99%
“…To quantitatively identify proteins, the physicochemical characteristics were obtained using a method (temporarily called 188d), which could extract sequence information and amino acid properties (Song et al, 2014;Xu et al, 2014Xu et al, , 2018Fu et al, 2019;Liu, 2019;Zhu et al, 2019). The first 20 elements in the results of this method denoted the frequency of the 20 original amino acids (Zhu et al, 2019); the next 24 features reflected the group proportion corresponding to three groups (Qu et al, 2019); the following 120 dimensions were the distributions of three groups in five local positions (Cai et al, 2003); the last 24 features were the numbers of three types of dipeptides.…”
Section: Physicochemical Characteristicsmentioning
confidence: 99%
“…In our experiment, we used the following four indicators to evaluate the predictive performance of our proposed model, including Accuracy (ACC), Sensitivity (SN), Specificity (SP), and Mathew's Correlation Coefficient (MCC). They are the four commonly used indicators for classifier performance evaluation in other Bioinformatics fields (Zhang et al, 2008(Zhang et al, , 2018a(Zhang et al, ,b,c, 2019bWei et al, 2017bWei et al, , 2019bZeng et al, 2017bZeng et al, , 2019cChen et al, 2018;Lu et al, 2018a,b;Fu et al, 2019;Gong et al, 2019;Jin et al, 2019;Liu and Li, 2019;Liu et al, 2019c,d;Manavalan et al, 2019a,b,c,d;Basith et al, 2020). Their calculation formulas are as follows:…”
Section: Performance Indicatorsmentioning
confidence: 99%
“…Since safety, stability, and specificity of peptides for various utilizations is of great value, there is always the need to introduce novel natural or synthetic CPPs. As protein derived penetrating peptides constitute about 80–90% of CPPs, it is arguable that huge quantities of CPPs are hidden in protein sequences ( Fu et al, 2019 ). Screening the proteome of the Coronaviridae family for this purpose was somehow neglected until the recent pandemic.…”
Section: Introductionmentioning
confidence: 99%
“…There are various indicators such as sensitivity, specificity, accuracy, receiver operating characteristic, and Matthew's correlation coefficient to assess the performance of the models. Then statistical methods such as jackknife test, k-fold cross validation, and independent test are used to verify indicators in different predictor models ( Fu et al, 2019 ).…”
Section: Introductionmentioning
confidence: 99%