2021
DOI: 10.1093/bioinformatics/btab811
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the multi-label protein subcellular localization through multi-information fusion and MLSI dimensionality reduction based on MLFE classifier

Abstract: Motivation Multi-label protein subcellular localization (SCL) is an indispensable way to study protein function. It can locate a certain protein (such as the human transmembrane protein that promotes the invasion of the SARS-CoV-2) or expression product at a specific location in a cell, which can provide a reference for clinical treatment of diseases such as COVID-19. Results The paper proposes a novel method named ML-locMLFE… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 63 publications
0
4
0
Order By: Relevance
“…But knowledge data are limited and only applicable to wellcurated proteins, which limits the predictive power of this kind of method for novel or newly discovered proteins. In recent studies [75][76][77], different kinds of information are fused together for better model performance, given that computational methods excel with high dimensional data as inputs.…”
Section: Knowledge-based Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…But knowledge data are limited and only applicable to wellcurated proteins, which limits the predictive power of this kind of method for novel or newly discovered proteins. In recent studies [75][76][77], different kinds of information are fused together for better model performance, given that computational methods excel with high dimensional data as inputs.…”
Section: Knowledge-based Methodsmentioning
confidence: 99%
“…The fusion methods can basically be divided into two categories: feature-level fusion [77,104,105] and decision-level fusion [106]. Feature-level fusion is mostly based on average pooling, weighted combination [107], serial combination, or concatenation of selected values.…”
Section: Knowledge-based Ai Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…But knowledge data are limited and only applicable to wellcurated proteins, which limits the predictive power of this kind of method for novel or newly discovered proteins. In recent studies [75][76][77], different kinds of information are fused together for better model performance, given that computational methods excel with high dimensional data as inputs.…”
Section: Knowledge-based Methodsmentioning
confidence: 99%
“…Most of them use general sequence features rather than hand-crafted features related to specific sorting signals and claim to be able to address the problem of proteins localized at multiple sites (i.e., multi-labeled proteins), though there still remains the problem that their training data do not seem to have been annotated with a uniform criterion (see below). Some of them proposed extensions of existing sequence features, such as the k -mer compositions ( Li et al, 2019 ; Yao et al, 2019 ; Sahu et al, 2020 ), while some imported external information, such as Gene Ontology and protein-protein interactions ( Chen et al, 2021 ; Liu et al, 2021 ; Zhang et al, 2021 ). One method employed an ensemble approach of multiple classifiers with voting, claiming that the approach is effective in addressing the problem of imbalanced sizes of training data between different localization sites ( Wattanapornprom et al, 2021 ).…”
Section: Miscellaneous Algorithmsmentioning
confidence: 99%