2019
DOI: 10.2174/1389202920666190809095206
|View full text |Cite
|
Sign up to set email alerts
|

iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General PseAAC via Chou’s 5-steps Rule

Abstract: Background: Methylation is one of the most important post-translational modifications in the human body which usually arises on lysine among.the most intensely modified residues. It performs a dynamic role in numerous biological procedures, such as regulation of gene expression, regulation of protein function and RNA processing. Therefore, to identify lysine methylation sites is an important challenge as some experimental procedures are time-consuming. Objective: Herein, we propose a computational predictor … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

4
5

Authors

Journals

citations
Cited by 45 publications
(11 citation statements)
references
References 170 publications
(144 reference statements)
0
11
0
Order By: Relevance
“…60% cutoff depicts that all sequences, which showed similarity more than 60% were excluded from the dataset to reduce redundancy in dataset and overfitting of the model. The reason for choosing this threshold was that it is supported by various previously reported studies (Shao and Chou 2020 ; Shao et al 2020 ; Khan et al 2019a , b ; Jia et al 2019 ; Ilyas et al 2019 ; Hussain et al 2019a , b ; Feng et al 2019 ; Cui et al 2019 ; Awais et al 2019 ). Clusters were formed comprising of 5101 non-redundant DNA replication proteins and 5227 non-redundant non-DNA replication proteins.…”
Section: Methodsmentioning
confidence: 90%
“…60% cutoff depicts that all sequences, which showed similarity more than 60% were excluded from the dataset to reduce redundancy in dataset and overfitting of the model. The reason for choosing this threshold was that it is supported by various previously reported studies (Shao and Chou 2020 ; Shao et al 2020 ; Khan et al 2019a , b ; Jia et al 2019 ; Ilyas et al 2019 ; Hussain et al 2019a , b ; Feng et al 2019 ; Cui et al 2019 ; Awais et al 2019 ). Clusters were formed comprising of 5101 non-redundant DNA replication proteins and 5227 non-redundant non-DNA replication proteins.…”
Section: Methodsmentioning
confidence: 90%
“…Further, statistical moments are computed from the obtained square matrix for dimensionality reduction and forming fixed-size feature vectors 27 , 28 . As discussed previously, the three moments deployed in this study are Hahn, central and raw moments.…”
Section: Methodsmentioning
confidence: 99%
“…Thus, to represent sequences as vectors, the pseudo amino acid composition (PseAAC) was proposed 17 . The idea of PseAAC is popular in bioinformatics research 18 , 19 and has been used in numerous bio-medicine and medication improvement studies 20 , 21 as well as other disciplines of computational proteomics. An extensive rundown of references is provided in a survey paper 22 .…”
Section: Methodsmentioning
confidence: 99%