As one of the most important reversible protein post-translation modifications, ubiquitination has been reported to be involved in lots of biological processes and closely implicated with various diseases. To fully decipher the molecular mechanisms of ubiquitination-related biological processes, an initial but crucial step is the recognition of ubiquitylated substrates and the corresponding ubiquitination sites. Here, a new bioinformatics tool named CKSAAP_UbSite was developed to predict ubiquitination sites from protein sequences. With the assistance of Support Vector Machine (SVM), the highlight of CKSAAP_UbSite is to employ the composition of k-spaced amino acid pairs surrounding a query site (i.e. any lysine in a query sequence) as input. When trained and tested in the dataset of yeast ubiquitination sites (Radivojac et al, Proteins, 2010, 78: 365–380), a 100-fold cross-validation on a 1∶1 ratio of positive and negative samples revealed that the accuracy and MCC of CKSAAP_UbSite reached 73.40% and 0.4694, respectively. The proposed CKSAAP_UbSite has also been intensively benchmarked to exhibit better performance than some existing predictors, suggesting that it can be served as a useful tool to the community. Currently, CKSAAP_UbSite is freely accessible at http://protein.cau.edu.cn/cksaap_ubsite/. Moreover, we also found that the sequence patterns around ubiquitination sites are not conserved across different species. To ensure a reasonable prediction performance, the application of the current CKSAAP_UbSite should be limited to the proteome of yeast.
Background: As one of the most common protein post-translational modifications, glycosylation is involved in a variety of important biological processes. Computational identification of glycosylation sites in protein sequences becomes increasingly important in the post-genomic era. A new encoding scheme was employed to improve the prediction of mucin-type O-glycosylation sites in mammalian proteins.
Sequence-based analysis and prediction are fundamental bioinformatic tasks that facilitate understanding of the sequence(-structure)-function paradigm for DNAs, RNAs and proteins. Rapid accumulation of sequences requires equally pervasive development of new predictive models, which depends on the availability of effective tools that support these efforts. We introduce iLearnPlus, the first machine-learning platform with graphical- and web-based interfaces for the construction of machine-learning pipelines for analysis and predictions using nucleic acid and protein sequences. iLearnPlus provides a comprehensive set of algorithms and automates sequence-based feature extraction and analysis, construction and deployment of models, assessment of predictive performance, statistical analysis, and data visualization; all without programming. iLearnPlus includes a wide range of feature sets which encode information from the input sequences and over twenty machine-learning algorithms that cover several deep-learning approaches, outnumbering the current solutions by a wide margin. Our solution caters to experienced bioinformaticians, given the broad range of options, and biologists with no programming background, given the point-and-click interface and easy-to-follow design process. We showcase iLearnPlus with two case studies concerning prediction of long noncoding RNAs (lncRNAs) from RNA transcripts and prediction of crotonylation sites in protein chains. iLearnPlus is an open-source platform available at https://github.com/Superzchen/iLearnPlus/ with the webserver at http://ilearnplus.erc.monash.edu/.
Sumoylation is one of the most essential mechanisms of reversible protein post-translational modifications and is a crucial biochemical process in the regulation of a variety of important biological functions. Sumoylation is also closely involved in various human diseases. The accurate computational identification of sumoylation sites in protein sequences aids in experimental design and mechanistic research in cellular biology. In this study, we introduced amino acid hydrophobicity as a parameter into a traditional binary encoding scheme and developed a novel sumoylation site prediction tool termed SUMOhydro. With the assistance of a support vector machine, the proposed method was trained and tested using a stringent non-redundant sumoylation dataset. In a leave-one-out cross-validation, the proposed method yielded an excellent performance with a correlation coefficient, specificity, sensitivity and accuracy equal to 0.690, 98.6%, 71.1% and 97.5%, respectively. In addition, SUMOhydro has been benchmarked against previously described predictors based on an independent dataset, thereby suggesting that the introduction of hydrophobicity as an additional parameter could assist in the prediction of sumoylation sites. Currently, SUMOhydro is freely accessible at http://protein.cau.edu.cn/others/SUMOhydro/.
AimTo investigate the role of neurotensin (NTS) in hepatocellular carcinoma (HCC) sub- grouping and the clinical and pathological significance of activation of NTS/IL-8 pathway in HCC.MethodsThe genome-wide gene expression profiling were conducted in 10 pairs of cancer tissues and corresponding normal adjacent tissues samples using Affymetrix GeneChip® Human Genome U133 Plus 2.0 microarray to screen differentially expressing genes and enrich dysfunctional activated pathways among different HCC subgroups. The levels of NTS protein and multiple inflammation and epithelial mesenchymal transition (EMT) related proteins, including IL-8, VEGF, MMP9, CD68, E-Cadherin, β-Catenin and Vimentin were examined in 64 cases of paraffin-embedded HCC samples using immunohistochemistry (IHC) staining method. The clinical outcome and overall survival (OS) were compared.ResultsA subgroup of HCC characterized by up-regulated NTS expression was accompanied by up-regulated inflammatory responses and EMT. The direct interaction between NTS and IL-8 was identified by pathway enrichment analysis. Significantly increased IL-8 protein was confirmed in 90.91% of NTS+ HCC samples and significantly positively correlated to the levels of NTS protein in cancer tissues (P = 0.036), which implied activation of NTS/IL-8 pathway in HCC. The levels of VEGF and MMP9 correlated with co-expression of NTS and IL-8. Increased infiltration of CD68+ macrophages and more cancer cells displaying EMT features were found in NTS+IL-8+ samples. The co-expression of NTS and IL-8 in cancer significantly correlated with the clinical outcomes, as the mortality rate of NTS+IL-8+ HCC patients is 2.5-fold higher than the others after the surgery (P = 0.022). Accordingly, the OS of NTS+IL-8+ HCC patients significantly decreased who are under a higher hazard of death at an expected hazard ratio (HR) of 3.457.ConclusionDysfunctional activation of the NTS/IL-8 pathway was detected in HCC which is associated with increased inflammatory response in microenvironment, enhanced EMT in cancer, and worse prognosis in HCC patients.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.