Meta-predictors make predictions by organizing and processing the predictions produced by several other predictors in a defined problem domain. A proficient meta-predictor not only offers better predicting performance than the individual predictors from which it is constructed, but it also relieves experimentally researchers from making difficult judgments when faced with conflicting results made by multiple prediction programs. As increasing numbers of predicting programs are being developed in a large number of fields of life sciences, there is an urgent need for effective meta-prediction strategies to be investigated. We compiled four unbiased phosphorylation site datasets, each for one of the four major serine/threonine (S/T) protein kinase families—CDK, CK2, PKA and PKC. Using these datasets, we examined several meta-predicting strategies with 15 phosphorylation site predictors from six predicting programs: GPS, KinasePhos, NetPhosK, PPSP, PredPhospho and Scansite. Meta-predictors constructed with a generalized weighted voting meta-predicting strategy with parameters determined by restricted grid search possess the best performance, exceeding that of all individual predictors in predicting phosphorylation sites of all four kinase families. Our results demonstrate a useful decision-making tool for analysing the predictions of the various S/T phosphorylation site predictors. An implementation of these meta-predictors is available on the web at: http://MetaPred.umn.edu/MetaPredPS/.
Phosphoprotein-binding domains (PPBDs) mediate many important cellular and molecular processes. Ten PPBDs have been known to exist in the human proteome, namely, 14-3-3, BRCT, C2, FHA, MH2, PBD, PTB, SH2, WD-40 and WW. PepCyber:P∼PEP is a newly constructed database specialized in documenting human PPBD-containing proteins and PPBD-mediated interactions. Our motivation is to provide the research community with a rich information source emphasizing the reported, experimentally validated data for specific PPBD–PPEP interactions. This information is not only useful for designing, comparing and validating the relevant experiments, but it also serves as a knowledge-base for computationally constructing systems signaling pathways and networks. PepCyber:P∼PEP is accessible through the URL, http://www.pepcyber.org/PPEP/. The current release of the database contains 7044 PPBD-mediated interactions involving 337 PPBD-containing proteins and 1123 substrate proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.