Breast cancer is a heterogeneous disease and one of the most common cancers among women. Recently, microRNAs (miRNAs) have been used as biomarkers due to their effective role in cancer diagnosis. This study proposes a support vector machine (SVM)-based classifier SVM-BRC to categorize patients with breast cancer into early and advanced stages. SVM-BRC uses an optimal feature selection method, inheritable bi-objective combinatorial genetic algorithm, to identify a miRNA signature which is a small set of informative miRNAs while maximizing prediction accuracy. MiRNA expression profiles of a 386-patient cohort of breast cancer were retrieved from The Cancer Genome Atlas. SVM-BRC identified 34 of 503 miRNAs as a signature and achieved a 10-fold cross-validation mean accuracy, sensitivity, specificity, and Matthews correlation coefficient of 80.38%, 0.79, 0.81, and 0.60, respectively. Functional enrichment of the 10 highest ranked miRNAs was analysed in terms of Kyoto Encyclopedia of Genes and Genomes and Gene Ontology annotations. Kaplan-Meier survival analysis of the highest ranked miRNAs revealed that four miRNAs, hsa-miR-503, hsa-miR-1307, hsa-miR-212 and hsa-miR-592, were significantly associated with the prognosis of patients with breast cancer.
Lung adenocarcinoma is a multifactorial disease. MicroRNA (miRNA) expression profiles are extensively used for discovering potential theranostic biomarkers of lung cancer. This work proposes an optimized support vector regression (SVR) method called SVR-LUAD to simultaneously identify a set of miRNAs referred to the miRNA signature for estimating the survival time of lung adenocarcinoma patients using their miRNA expression profiles. SVR-LUAD uses an inheritable bi-objective combinatorial genetic algorithm to identify a small set of informative miRNAs cooperating with SVR by maximizing estimation accuracy. SVR-LUAD identified 18 out of 332 miRNAs using 10-fold cross-validation and achieved a correlation coefficient of 0.88 ± 0.01 and mean absolute error of 0.56 ± 0.03 year between real and estimated survival time. SVR-LUAD performs well compared to some well-recognized regression methods. The miRNA signature consists of the 18 miRNAs which strongly correlates with lung adenocarcinoma: hsa-let-7f-1, hsa-miR-16-1, hsa-miR-152, hsa-miR-217, hsa-miR-18a, hsa-miR-193b, hsa-miR-3136, hsa-let-7g, hsa-miR-155, hsa-miR-3199-1, hsa-miR-219-2, hsa-miR-1254, hsa-miR-1291, hsa-miR-192, hsa-miR-3653, hsa-miR-3934, hsa-miR-342, and hsa-miR-141. Gene ontology annotation and pathway analysis of the miRNA signature revealed its biological significance in cancer and cellular pathways. This miRNA signature could aid in the development of novel therapeutic approaches to the treatment of lung adenocarcinoma.
BackgroundHeme binding proteins (HBPs) are metalloproteins that contain a heme ligand (an iron-porphyrin complex) as the prosthetic group. Several computational methods have been proposed to predict heme binding residues and thereby to understand the interactions between heme and its host proteins. However, few in silico methods for identifying HBPs have been proposed.ResultsThis work proposes a scoring card method (SCM) based method (named SCMHBP) for predicting and analyzing HBPs from sequences. A balanced dataset of 747 HBPs (selected using a Gene Ontology term GO:0020037) and 747 non-HBPs (selected from 91,414 putative non-HBPs) with an identity of 25% was firstly established. Consequently, a set of scores that quantified the propensity of amino acids and dipeptides to be HBPs is estimated using SCM to maximize the predictive accuracy of SCMHBP. Finally, the informative physicochemical properties of 20 amino acids are identified by utilizing the estimated propensity scores to be used to categorize HBPs. The training and mean test accuracies of SCMHBP applied to three independent test datasets are 85.90% and 71.57%, respectively. SCMHBP performs well relative to comparison with such methods as support vector machine (SVM), decision tree J48, and Bayes classifiers. The putative non-HBPs with high sequence propensity scores are potential HBPs, which can be further validated by experimental confirmation. The propensity scores of individual amino acids and dipeptides are examined to elucidate the interactions between heme and its host proteins. The following characteristics of HBPs are derived from the propensity scores: 1) aromatic side chains are important to the effectiveness of specific HBP functions; 2) a hydrophobic environment is important in the interaction between heme and binding sites; and 3) the whole HBP has low flexibility whereas the heme binding residues are relatively flexible.ConclusionsSCMHBP yields knowledge that improves our understanding of HBPs rather than merely improves the prediction accuracy in predicting HBPs.
BackgroundThough glioblastoma multiforme (GBM) is the most frequently occurring brain malignancy in adults, clinical treatment still faces challenges due to poor prognoses and tumor relapses. Recently, microRNAs (miRNAs) have been extensively used with the aim of developing accurate molecular therapies, because of their emerging role in the regulation of cancer-related genes. This work aims to identify the miRNA signatures related to survival of GBM patients for developing molecular therapies.ResultsThis work proposes a support vector regression (SVR)-based estimator, called SVR-GBM, to estimate the survival time in patients with GBM using their miRNA expression profiles. SVR-GBM identified 24 out of 470 miRNAs that were significantly associated with survival of GBM patients. SVR-GBM had a mean absolute error of 0.63 years and a correlation coefficient of 0.76 between the real and predicted survival time. The 10 top-ranked miRNAs according to prediction contribution are as follows: hsa-miR-222, hsa-miR-345, hsa-miR-587, hsa-miR-526a, hsa-miR-335, hsa-miR-122, hsa-miR-24, hsa-miR-433, hsa-miR-574 and hsa-miR-320. Biological analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway on the identified miRNAs revealed their influence in GBM cancer.ConclusionThe proposed SVR-GBM using an optimal feature selection algorithm and an optimized SVR to identify the 24 miRNA signatures associated with survival of GBM patients. These miRNA signatures are helpful to uncover the individual role of miRNAs in GBM prognosis and develop miRNA-based therapies.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-3321-y) contains supplementary material, which is available to authorized users.
Hepatocellular carcinoma (HCC) is one of the leading causes of cancer deaths worldwide. Recently, microRNAs (miRNAs) are reported to be altered and act as potential biomarkers in various cancers. However, miRNA biomarkers for predicting the stage of HCC are limitedly discovered. Hence, we sought to identify a novel miRNA signature associated with cancer stage in HCC. We proposed a support vector machine (SVM)-based cancer stage prediction method, SVM-HCC, which uses an inheritable bi-objective combinatorial genetic algorithm for selecting a minimal set of miRNA biomarkers while maximizing the accuracy of predicting the early and advanced stages of HCC. SVM-HCC identified a 23-miRNA signature that is associated with cancer stages in patients with HCC and achieved a 10-fold cross-validation accuracy, sensitivity, specificity, Matthews correlation coefficient, and area under the receiver operating characteristic curve (AUC) of 92.59%, 0.98, 0.74, 0.80, and 0.86, respectively; and test accuracy and test AUC of 74.28% and 0.73, respectively. We prioritized the miRNAs in the signature based on their contributions to predictive performance, and validated the prognostic power of the prioritized miRNAs using Kaplan–Meier survival curves. The results showed that seven miRNAs were significantly associated with prognosis in HCC patients. Correlation analysis of the miRNA signature and its co-expressed miRNAs revealed that hsa-let-7i and its 13 co-expressed miRNAs are significantly involved in the hepatitis B pathway. In clinical practice, a prediction model using the identified 23-miRNA signature could be valuable for early-stage detection, and could also help to develop miRNA-based therapeutic strategies for HCC.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.