Functional switches are often regulated by dynamic protein modifications. Assessing protein functions, in vivo, and their functional switches remains still a great challenge in this age of development. An alternative methodology based on in silico procedures may facilitate assessing the multifunctionality of proteins and, in addition, allow predicting functions of those proteins that exhibit their functionality through transitory modifications. Extensive research is ongoing to predict the sequence of protein modification sites and analyze their dynamic nature. This study reports the analysis performed on phosphorylation, Phospho.ELM (version 3.0) and glycosylation, OGlycBase (version 6.0) data for mining association patterns utilizing a newly developed algorithm, MAPRes. This method, MAPRes (Mining Association Patterns among preferred amino acid residues in the vicinity of amino acids targeted for post-translational modifications), is based on mining association among significantly preferred amino acids of neighboring sequence environment and modification sites themselves. Association patterns arrived at by association pattern/rule mining were in significant conformity with the results of different approaches. However, attempts to analyze substrate sequence environment of phosphorylation sites catalyzed for Tyr kinases and the sequence data for O-GlcNAc modification were not successful, due to the limited data available. Using the MAPRes algorithm for developing an association among PTM site with its vicinal amino acids is a valid method with many potential uses: this is indeed the first method ever to apply the association pattern mining technique to protein post-translational modification data.
Phosphorylation, one of the most common protein post-translational modifications (PTMs) on hydroxyl groups of S/T/Y is catalyzed by kinases and involves the presence or absence of certain amino acid residues in the vicinity of the phosphorylation sites. Using MAPRes, we have analyzed the substrate proteins of Phospho.ELM 7.0 and found that there are both general and specific requirements for the presence or absence of particular amino acids in the vicinity of phosphorylated S/T/Y for both of the phosphorylation data, whether or not kinase information was taken into account. Patterns extracted by MAPRes for kinase-specific data have been utilized to find the consensus sequence motifs for various kinases required to catalyze the process of phosphorylation on S/T/Y. These consensus sequences for different kinase groups, families, and individual members are consistent with those described earlier with some novel consensus reported for the first time. A comparison study for the patterns mined by MAPRes with the results of existing prediction methods was performed by searching for these patterns in the vicinity of phosphorylation sites predicted by different available method. This comparison resulted in 87-98% conformity with the results of the predictions by available methods. Additionally, the patterns mined by MAPRes for substrate sites included 61 kinases, the highest number analyzed so far.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.