International initiatives such as the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) are collecting multiple datasets at different genome-scales with the aim of identifying novel cancer biomarkers and predicting survival of patients. To analyze such data, several statistical methods have been applied, among them Cox regression models. Although these models provide a good statistical framework to analyze omic data, there is still a lack of studies that illustrate advantages and drawbacks in integrating biological information and selecting groups of biomarkers. In fact, classical Cox regression algorithms focus on the selection of a single biomarker, without taking into account the strong correlation between genes. Even though network-based Cox regression algorithms overcome such drawbacks, such network-based approaches are less widely used within the life science community. In this article, we aim to provide a clear methodological framework on the use of such approaches in order to turn cancer research results into clinical applications. Therefore, we first discuss the rationale and the practical usage of three recently proposed network-based Cox regression algorithms (i.e., Net-Cox, AdaLnet, and fastcox). Then, we show how to combine existing biological knowledge and available data with such algorithms to identify networks of cancer biomarkers and to estimate survival of patients. Finally, we describe in detail a new permutation-based approach to better validate the significance of the selection in terms of cancer gene signatures and pathway/networks identification. We illustrate the proposed methodology by means of both simulations and real case studies. Overall, the aim of our work is two-fold. Firstly, to show how network-based Cox regression models can be used to integrate biological knowledge (e.g., multi-omics data) for the analysis of survival data. Secondly, to provide a clear methodological and computational approach for investigating cancers regulatory networks.
Breast cancer is one of the most common invasive tumors causing high mortality among women. It is characterized by high heterogeneity regarding its biological and clinical characteristics. Several high-throughput assays have been used to collect genome-wide information for many patients in large collaborative studies. This knowledge has improved our understanding of its biology and led to new methods of diagnosing and treating the disease. In particular, system biology has become a valid approach to obtain better insights into breast cancer biological mechanisms. A crucial component of current research lies in identifying novel biomarkers that can be predictive for breast cancer patient prognosis on the basis of the molecular signature of the tumor sample. However, the high dimension and low sample size of data greatly increase the difficulty of cancer survival analysis demanding for the development of ad-hoc statistical methods. In this work, we propose novel screening-network methods that predict patient survival outcome by screening key survival-related genes and we assess the capability of the proposed approaches using METABRIC dataset. In particular, we first identify a subset of genes by using variable screening techniques on gene expression data. Then, we perform Cox regression analysis by incorporating network information associated with the selected subset of genes. The novelty of this work consists in the improved prediction of survival responses due to the different types of screenings (i.e., a biomedical-driven, data-driven and a combination of the two) before building the network-penalized model. Indeed, the combination of the two screening approaches allows us to use the available biological knowledge on breast cancer and complement it with additional information emerging from the data used for the analysis. Moreover, we also illustrate how to extend the proposed approaches to integrate an additional omic layer, such as copy number aberrations, and we show that such strategies can further improve our prediction capabilities. In conclusion, our approaches allow to discriminate patients in high-and low-risk groups using few potential biomarkers and therefore, can help clinicians to provide more precise prognoses and to facilitate the subsequent clinical management of patients at risk of disease.
Duchenne muscular dystrophy (DMD) affects myofibers and muscle stem cells, causing progressive muscle degeneration and repair defects. It was unknown whether dystrophic myoblasts—the effector cells of muscle growth and regeneration—are affected. Using transcriptomic, genome-scale metabolic modelling and functional analyses, we demonstrate, for the first time, convergent abnormalities in primary mouse and human dystrophic myoblasts. In Dmdmdx myoblasts lacking full-length dystrophin, the expression of 170 genes was significantly altered. Myod1 and key genes controlled by MyoD (Myog, Mymk, Mymx, epigenetic regulators, ECM interactors, calcium signalling and fibrosis genes) were significantly downregulated. Gene ontology analysis indicated enrichment in genes involved in muscle development and function. Functionally, we found increased myoblast proliferation, reduced chemotaxis and accelerated differentiation, which are all essential for myoregeneration. The defects were caused by the loss of expression of full-length dystrophin, as similar and not exacerbated alterations were observed in dystrophin-null Dmdmdx-βgeo myoblasts. Corresponding abnormalities were identified in human DMD primary myoblasts and a dystrophic mouse muscle cell line, confirming the cross-species and cell-autonomous nature of these defects. The genome-scale metabolic analysis in human DMD myoblasts showed alterations in the rate of glycolysis/gluconeogenesis, leukotriene metabolism, and mitochondrial beta-oxidation of various fatty acids. These results reveal the disease continuum: DMD defects in satellite cells, the myoblast dysfunction affecting muscle regeneration, which is insufficient to counteract muscle loss due to myofiber instability. Contrary to the established belief, our data demonstrate that DMD abnormalities occur in myoblasts, making these cells a novel therapeutic target for the treatment of this lethal disease.
Ductal carcinoma is one of the most common cancers among women, and the main cause of death is the formation of metastases. The development of metastases is caused by cancer cells that migrate from the primary tumour site (the mammary duct) through the blood vessels and extravasating they initiate metastasis. Here, we propose a multi-compartment model which mimics the dynamics of tumoural cells in the mammary duct, in the circulatory system and in the bone. Through a branching process model, we describe the relation between the survival times and the four markers mainly involved in metastatic breast cancer (EPCAM, CD47, CD44 and MET). In particular, the model takes into account the gene expression profile of circulating tumour cells to predict personalised survival probability. We also include the administration of drugs as bisphosphonates, which reduce the formation of circulating tumour cells and their survival in the blood vessels, in order to analyse the dynamic changes induced by the therapy.We analyse the effects of circulating tumour cells on the progression of the disease providing a quantitative measure of the cell driver mutations needed for invading the bone tissue. Our model allows to design intervention scenarios that alter the patient-specific survival probability by modifying the populations of circulating tumour cells and it could be extended to other cancer metastasis dynamics.
This paper is concerned with the complex behavior arising in satisfiability problems. We present a new statistical physics-based characterization of the satisfiability problem. Specifically, we design an algorithm that is able to produce graphs starting from a k-SAT instance, in order to analyze them and show whether a Bose-Einstein condensation occurs. We observe that, analogously to complex networks, the networks of k-SAT instances follow Bose statistics and can undergo Bose-Einstein condensation. In particular, k-SAT instances move from a fit-get-rich network to a winner-takes-all network as the ratio of clauses to variables decreases, and the phase transition of k-SAT approximates the critical temperature for the Bose-Einstein condensation. Finally, we employ the fitness-based classification to enhance SAT solvers (e.g., ChainSAT) and obtain the consistently highest performing SAT solver for CNF formulas, and therefore a new class of efficient hardware and software verification tools.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.