2019
DOI: 10.1016/j.compbiolchem.2019.107088
|View full text |Cite
|
Sign up to set email alerts
|

DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 30 publications
1
9
0
Order By: Relevance
“…2, the k-mer aptamer frequency implies that the k-mer usage is an essential factor for APIs. This finding is justified by the previous studies [87][88][89][90][91][92] which proved k-mer frequency plays an important role in interaction related to riboswitch, DNA, RNA, ncRNA, lncRNA, etc. This may be due to aptamers in this study were considered as the type of RNA and DNA.…”
Section: Discussionsupporting
confidence: 53%
“…2, the k-mer aptamer frequency implies that the k-mer usage is an essential factor for APIs. This finding is justified by the previous studies [87][88][89][90][91][92] which proved k-mer frequency plays an important role in interaction related to riboswitch, DNA, RNA, ncRNA, lncRNA, etc. This may be due to aptamers in this study were considered as the type of RNA and DNA.…”
Section: Discussionsupporting
confidence: 53%
“…Most existing studies extracted ncRNA and protein sequence features by using a simple k-mer: 3-mer frequency feature for protein and 4-mer frequency feature for ncRNA [22,24,26,29,33]. For protein, 20 amino acids can be classified into seven groups based on their dipole moments and side-chain volume: 1 =AA, G, V,, 2 =AI, L, F, P,, =AY, M, T, S,, 4 =AH, N, Q, W,, 5 =AR, K,, 6 =AD, E, and 7 ={C} [33].…”
Section: Sequence Codingmentioning
confidence: 99%
“…Besides, Wang et al utilized the deep convolutional neural network (CNN) to learn high-level features from the RNA and protein sequences, further feeding them into an extreme learning machine (ELM) for classification [23]. Furthermore, our group designed DM-RPIs, a classifier integrated SVM, RF, and CNN to classify ncRPIs by learning the discriminative features from 3-mer and 4-mer frequency of proteins and ncRNAs, respectively [24]. In addition, LightGBM, rpiCOOL, RPIFSE, RPI-SAN, and LPI-CNNCP also made ncRNA-protein interaction(ncRPI) predictions based on primary sequence [25][26][27][28][29].…”
Section: Introductionmentioning
confidence: 99%
“…• each protein sequence is treated as a sentence, made by overlapping words (k-mers) to incorporate some context-order information in the resulting distributed representation; • the word size is 3, which seems to work properly to embed amino acid sequences for biological tasks (S. Cheng et al, 2019, Yi et al (2020); • the sequence vector is defined as the arithmetic mean of all its word vectors.…”
Section: Continuous Distributed Representations For Protein Sequencesmentioning
confidence: 99%