BackgroundCoordinated efforts to collect large-scale data sets provide a basis for systems level understanding of complex diseases. In order to translate these fragmented and heterogeneous data sets into knowledge and medical benefits, advanced computational methods for data analysis, integration and visualization are needed.MethodsWe introduce a novel data integration framework, Anduril, for translating fragmented large-scale data into testable predictions. The Anduril framework allows rapid integration of heterogeneous data with state-of-the-art computational methods and existing knowledge in bio-databases. Anduril automatically generates thorough summary reports and a website that shows the most relevant features of each gene at a glance, allows sorting of data based on different parameters, and provides direct links to more detailed data on genes, transcripts or genomic regions. Anduril is open-source; all methods and documentation are freely available.ResultsWe have integrated multidimensional molecular and clinical data from 338 subjects having glioblastoma multiforme, one of the deadliest and most poorly understood cancers, using Anduril. The central objective of our approach is to identify genetic loci and genes that have significant survival effect. Our results suggest several novel genetic alterations linked to glioblastoma multiforme progression and, more specifically, reveal Moesin as a novel glioblastoma multiforme-associated gene that has a strong survival effect and whose depletion in vitro significantly inhibited cell proliferation. All analysis results are available as a comprehensive website.ConclusionsOur results demonstrate that integrated analysis and visualization of multidimensional and heterogeneous data by Anduril enables drawing conclusions on functional consequences of large-scale molecular data. Many of the identified genetic loci and genes having significant survival effect have not been reported earlier in the context of glioblastoma multiforme. Thus, in addition to generally applicable novel methodology, our results provide several glioblastoma multiforme candidate genes for further studies.Anduril is available at http://csbi.ltdk.helsinki.fi/anduril/The glioblastoma multiforme analysis results are available at http://csbi.ltdk.helsinki.fi/anduril/tcga-gbm/
The development of therapeutic strategies to combat immune-associated diseases requires the molecular mechanisms of human Th17 cell differentiation to be fully identified and understood. To investigate transcriptional control of Th17 cell differentiation, we used primary human CD4 T cells in small interfering RNA (siRNA)-mediated gene silencing and chromatin immunoprecipitation followed by massive parallel sequencing (ChIP-seq) to identify both the early direct and indirect targets of STAT3. The integrated dataset presented in this study confirms that STAT3 is critical for transcriptional regulation of early human Th17 cell differentiation. Additionally, we found that a number of SNPs from loci associated with immune-mediated disorders were located at sites where STAT3 binds to induce Th17 cell specification. Importantly, introduction of such SNPs alters STAT3 binding in DNA affinity precipitation assays. Overall, our study provides important insights for modulating Th17-mediated pathogenic immune responses in humans.
Defects in the mismatch repair system lead to microsatellite instability (MSI), a feature observed in~15% of all colorectal cancers (CRCs). Microsatellite mutations that drive tumourigenesis, typically inactivation of tumour suppressors, are selected for and are frequently detected in MSI cancers. Here, we evaluated somatic mutations in microsatellite repeats of 790 genes chosen based on reduced expression in MSI CRC and existence of a coding mononucleotide repeat of 6-10 bp in length. All the repeats were initially sequenced in 30 primary MSI CRC samples and whenever frameshift mutations were identified in >20%, additional 70 samples were sequenced. To distinguish driver mutations from passengers, we similarly analyzed the occurrence of frameshift mutations in 121 intronic control repeats and utilized a statistical regression model to determine cutoff mutation frequencies for repeats of all types (A/T and C/G, 6-10 bp). Along with several know target genes, including TGFBR2, ACVR2, and MSH3, six novel candidate driver genes emerged that harbored significantly more mutations than identical control repeats. The mutation frequencies in 100 MSI CRC samples were 51% in G8 of GLYR1, 47% in T9 of ABCC5, 43% in G8 of WDTC1, 33% in A8 of ROCK1, 30% in T8 of OR51E2, and 28% in A8 of TCEB3. Immunohistochemical staining of GLYR1 revealed defective protein expression in tumors carrying biallelic mutations, supporting a loss of function hypothesis. This is a large scale, unbiased effort to identify genes that when mutated are likely to contribute to MSI CRC development.DNA mismatch repair (MMR) system recognizes and removes misincorporations and slippage errors occurring in normal DNA replication. Defects in the MMR system lead to genetic instability referred to as microsatellite instability (MSI). MSI occurs as a consequence of a germline defect and a subsequent somatic inactivation of the wild-type allele of one of the key genes involved in this system, MLH1, MSH2, MSH6, and PMS2. 1 In sporadic CRC, MSI is typically caused
A common single nucleotide polymorphism (SNP), rs6983267, at 8q24.21 has recently been shown to associate with colorectal cancer (CRC). Three independent SNP association studies showed that rs6983267 contributes to CRC with odds ratios (OR) of 1.17 to 1.22. Here, we genotyped a populationbased series of 1,042 patients with CRC and 1,012 healthy controls for rs6983267 and determined the contribution of SNP to CRC in Finland, using germ line DNA, as well as the respective cancer DNA in heterozygous patients. The comprehensive clinical data available from the 1,042 patients and their first-degree relatives enabled us to thoroughly examine the possible association of this variant with different clinical features. As expected, a significant association between the G allele of rs6983267 and CRC [OR, 1.22; 95% confidence interval (CI), 1.08-1.38; P = 0.0018] was found, confirming the previous observations. A trend towards association of the G allele with microsatellite-stable cancer (OR, 1.37; 95% CI, 1.02-1.85; P = 0.04) and family history of cancers other than CRC was seen (OR, 1.20; 95% CI, 1-1.43; P = 0.05). Four hundred and sixty-six GT heterozygotes identified in this study were analyzed for allelic imbalance at rs6983267 in the respective cancer DNA. One hundred and one tumors showed allelic imbalance (22%). The risk allele G was favored in 67 versus 34 tumors (P = 0.0007). This finding implicates that the underlying germ line genetic defect in 8q24.21 is a target in the somatic evolution of CRC.
The circadian clock regulates daily variations in physiologic processes. CLOCK acts as a regulator in the circadian apparatus controlling the expression of other clock genes, including PER1. Clock genes have been implicated in cancer-related functions; in this work, we investigated CLOCK as a possible target of somatic mutations in microsatellite unstable colorectal cancers. Combining microarray gene expression data and public gene sequence information, we identified CLOCK as 1 of 790 putative novel microsatellite instability (MSI) target genes. A total of 101 MSI colorectal carcinomas (CRC) were sequenced for a coding microsatellite in CLOCK. The effect of restoring CLOCK expression was studied in LS180 cells lacking wild-type CLOCK by stably expressing GST-CLOCK or glutathione S-transferase empty vector and testing the effects of UV-induced apoptosis and radiation by DNA content analysis using flow cytometry. Putative novel CLOCK target genes were searched by using ChIP-seq. CLOCK mutations occurred in 53% of MSI CRCs. Restoring CLOCK expression in cells with biallelic CLOCK inactivation resulted in protection against UV-induced apoptosis and decreased G 2 -M arrest in response to ionizing radiation. Using ChIP-Seq, novel CLOCK-binding elements were identified near DNA damage genes p21, NBR1, BRCA1, and RAD50. CLOCK is shown to be mutated in cancer, and altered response to DNA damage provides one plausible mechanism of tumorigenesis. Mol Cancer Res; 8(7); 952-60. ©2010 AACR.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.