2023
DOI: 10.1101/2023.01.30.526378
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A random forest model for predicting exosomal proteins using evolutionary information and motifs

Abstract: Identification of secretory proteins in body fluids is one of the key challenges in the development of non-invasive diagnostics. It has been shown in the part that a significant number of proteins are secreted by cells via exosomes called exosomal proteins. In this study, an attempt has been made to build a model that can predict exosomal proteins with high precision. All models are trained, tested, and evaluated on a non-redundant dataset comprising 2831 exosomal and 2831 non-exosomal proteins, where no two p… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(10 citation statements)
references
References 44 publications
0
10
0
Order By: Relevance
“…The protein’s evolutionary features are known to provide additional crucial information about proteins [59,60]. The evolutionary information of proteins was retrieved from a position-specific scoring matrix (PSSM) profile generated using Position-Specific Iterated BLAST (PSI-BLAST) [61].…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The protein’s evolutionary features are known to provide additional crucial information about proteins [59,60]. The evolutionary information of proteins was retrieved from a position-specific scoring matrix (PSSM) profile generated using Position-Specific Iterated BLAST (PSI-BLAST) [61].…”
Section: Methodsmentioning
confidence: 99%
“…It has been shown in number of studies in the past that evolutionary information based models perform better than single sequence models [72][73][74][75]. Thus, this study also explores the potential of evolutionary information in predicting AFPs.…”
Section: Pssm Profilesmentioning
confidence: 97%
See 2 more Smart Citations
“…For eg., if a query sequence was hit against the hormonal peptide sequence, it was classified as hormone otherwise it was classified as non-hormone. Several researchers have used and thoroughly annotated this methodology [24,26]. The BLAST database was created using all sequences present in the training dataset.…”
Section: Blast For Similarity Searchmentioning
confidence: 99%