2013
DOI: 10.1109/tcbb.2013.111
|View full text |Cite|
|
Sign up to set email alerts
|

Protein Function Prediction Using Multilabel Ensemble Classification

Abstract: Abstract-High-throughput experimental techniques produce several kinds of heterogeneous proteomic and genomic datasets. To computationally annotate proteins, it is necessary and promising to integrate these heterogeneous data sources. Some methods transform these data sources into different kernels or feature representations. Next, these kernels are linearly (or non-linearly) combined into a composite kernel. The composite kernel is utilized to develop a predictive model to infer the function of proteins. A pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 49 publications
(4 citation statements)
references
References 34 publications
0
4
0
Order By: Relevance
“…These predictors have been immensely successful in producing accurate predictions for many biomedical prediction tasks (Yang et al (2010); Altmann et al (2008); Liu et al (2012); Khan et al (2010); Pandey et al (2010)), including protein function prediction (Valentini (2014); Yu et al (2013); Guan et al (2008)). The success of these methods is attributed to their ability to reinforce accurate predictions as well as correct errors across many diverse base predictors (Tumer and Ghosh (1996)).…”
Section: Introductionmentioning
confidence: 99%
“…These predictors have been immensely successful in producing accurate predictions for many biomedical prediction tasks (Yang et al (2010); Altmann et al (2008); Liu et al (2012); Khan et al (2010); Pandey et al (2010)), including protein function prediction (Valentini (2014); Yu et al (2013); Guan et al (2008)). The success of these methods is attributed to their ability to reinforce accurate predictions as well as correct errors across many diverse base predictors (Tumer and Ghosh (1996)).…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, they are not robust to noisy links and dense structures often present in certain network types (e.g., large blocks in gene co-expression networks or large hubs in protein-protein interaction networks); in this case, the resulting network is obscured by links from the noisy network types and can significantly impair the classification performance. To overcome these problems, some approaches train individual classifiers on these networks and then use ensemble learning methods to combine their predictions [35,26,32]. Lastly, such methods do not typically take into account correlations between different data sources, and often suffer from learning time and memory constraints.…”
Section: Introductionmentioning
confidence: 99%
“…In this way, each data source can be transformed into a network (or kernel). To leverage the networks derived from heterogeneous data sources to predict protein functions, some approaches first train individual classifiers on these networks and then use ensemble learning techniques to combine these classifiers [ 7 , 9 , 11 , 18 ]. Another set of algorithms first integrate these networks into a composite network and then train network-based learning methods [ 5 , 14 - 16 ].…”
Section: Introductionmentioning
confidence: 99%