2017
DOI: 10.1002/pmic.201700262
|View full text |Cite
|
Sign up to set email alerts
|

HPSLPred: An Ensemble Multi‐Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source

Abstract: Protein subcellular localization prediction is an important and challenging problem. The traditional biology experiments are expensive and time-consuming, so more and more research interests tend to a series of machine learning approaches for predicting protein subcellular location. There are two main difficult problems among the existing state-of-the-art methods. First, most of the existing techniques are designed to deal with the multi-class but not the multi-label classification, which ignores the connectio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
57
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 107 publications
(57 citation statements)
references
References 65 publications
0
57
0
Order By: Relevance
“…Determination of protein subcellular location is significant, especially for target identification [35]. Furthermore, location prediction can give an idea on the role of a query protein and whether it is categorized as a cytoplasmic, membrane, or secretory protein.…”
Section: Subcellular Localization and Secretome Analysesmentioning
confidence: 99%
“…Determination of protein subcellular location is significant, especially for target identification [35]. Furthermore, location prediction can give an idea on the role of a query protein and whether it is categorized as a cytoplasmic, membrane, or secretory protein.…”
Section: Subcellular Localization and Secretome Analysesmentioning
confidence: 99%
“…Synthetic minority oversampling technique (SMOTE) [23] is an oversampling approach in which new synthetic samples are generated from the minority class on the basis of interpolation and succeeded in several bioinformatics problems. [24] The k nearest neighbors were identified for each sample under consideration in minority class by Euclidean distance. Synthetic samples were introduced along the line segments from the consideration sample to the selected nearest neighbors.…”
Section: Synthetic Minority Oversampling Techniquementioning
confidence: 99%
“…Protein function prediction is usually treated as a multi-label classification problem. Researchers have tried different computation methods in the last few decades for this problem [ 6 , 7 , 8 , 9 , 10 , 11 , 12 ]. In general, the following methods are used for protein function prediction.…”
Section: Introductionmentioning
confidence: 99%