2020
DOI: 10.1039/d0mo00025f
|View full text |Cite
|
Sign up to set email alerts
|

DeepRMethylSite: a deep learning based approach for prediction of arginine methylation sites in proteins

Abstract: DeepRMethylSite is an ensemble-based deep learning model that takes protein sequences as input and predicts sites of Arginine methylation. The implementation and source code are provided at https://github.com/dukkakc/DeepRMethylSite.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 26 publications
(28 citation statements)
references
References 29 publications
0
28
0
Order By: Relevance
“…For example, Alanine (A) is represented as 100000000000000000000, Arginine (R) is represented as 010000000000000000000, and so on. However, PTM classification models such as DeepSuccinylSite 12 , DL-Malosite 13 , and DeepRMethylSite 14 implemented an embedded encoding scheme 15 with better performance metrics than one-hot encoding. In this study, we used an embedding layer for the encoding of protein sequences.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, Alanine (A) is represented as 100000000000000000000, Arginine (R) is represented as 010000000000000000000, and so on. However, PTM classification models such as DeepSuccinylSite 12 , DL-Malosite 13 , and DeepRMethylSite 14 implemented an embedded encoding scheme 15 with better performance metrics than one-hot encoding. In this study, we used an embedding layer for the encoding of protein sequences.…”
Section: Methodsmentioning
confidence: 99%
“…Both MusiteDeep and DeepPhos employ binary encoding, which is static in nature. Our previous DL-based predictors for succinylation 12 , malonylation 13 , and methylation 14 instead utilize embedding 15 for encoding, demonstrating significantly improved model performance compared to binary encoding.…”
Section: Introductionmentioning
confidence: 99%
“…Any redundant sequences within and between the positive and negative sites were removed to obtain a non-redundant set. Similar to our previous studies ( Chaudhari et al, 2020 ; Thapa et al, 2020 ), we used an under-sampling strategy to balance the dataset, which had more negative sites than positive sites prior to balancing ( Aridas GLitaFNaCK, 2017 ). Under-sampling allows random selection of negative sequences to make the number of negative sites equal to the number of positive sequences, thus balancing the dataset.…”
Section: Methodsmentioning
confidence: 99%
“…In recent years, deep learning (DL) based methods have been used to predict the PTM sites in cellular proteins. Typical applications include DeepSuccinylSite [54], MusiteDeep [55], DeepRMethylSite [56], and DeepPhos [57]. In DL, a suitable raw vector is given to the architecture and transformed into highly abstract features by propagating through whole model.…”
Section: Introductionmentioning
confidence: 99%