Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/.
Ribosome profiling revealed widespread translational activity at upstream open reading frames (uORFs) and validated uORF-mediated translational control as a commonly repressive mechanism of gene expression. Translational activation of proto-oncogenes through loss-of-uORF mutations has been demonstrated, yet a systematic search for cancer-associated genetic alterations in uORFs is lacking. Here, we applied a PCR-based, multiplex identifier-tagged deep sequencing approach to screen 404 uORF translation initiation sites of 83 human tyrosine kinases and 49 other proto-oncogenes in 308 human malignancies. We identified loss-of-function uORF mutations in EPHB1 in two samples derived from breast and colon cancer, and in MAP2K6 in a sample of colon adenocarcinoma. Both mutations were associated with enhanced translation, suggesting that loss-of-uORF-mediated translational induction of the downstream main protein coding sequence may have contributed to carcinogenesis. Computational analysis of whole exome sequencing datasets of 464 colon adenocarcinomas subsequently revealed another 53 non-recurrent somatic mutations functionally deleting 22 uORF initiation and 31 uORF termination codons, respectively. These data provide evidence for somatic mutations affecting uORF initiation and termination codons in human cancer. The insufficient coverage of uORF regions in current whole exome sequencing datasets demands for future genome-wide analyses to ultimately define the contribution of uORF-mediated translational deregulation in oncogenesis.
Background The fast-moving progress of the third-generation long-read sequencing technologies will soon bring the biological and medical sciences to a new era of research. Altogether, the technique and experimental procedures are becoming more straightforward and available to biologists from diverse fields, even without any profound experience in DNA sequencing. Thus, the introduction of the MinION device by Oxford Nanopore Technologies promises to “bring sequencing technology to the masses” and also allows quick and operative analysis in field studies. However, the convenience of this sequencing technology dramatically contrasts with the available analysis tools, which may significantly reduce enthusiasm of a “regular” user. To really bring the sequencing technology to every biologist, we need a set of user-friendly tools that can perform a powerful analysis in an automatic manner. Findings NanoPipe was developed in consideration of the specifics of the MinION sequencing technologies, providing accordingly adjusted alignment parameters. The range of the target species/sequences for the alignment is not limited, and the descriptive usage page of NanoPipe helps a user to succeed with NanoPipe analysis. The results contain alignment statistics, consensus sequence, polymorphisms data, and visualization of the alignment. Several test cases are used to demonstrate the efficiency of the tool. Conclusions Freely available NanoPipe software allows effortless and reliable analysis of MinION sequencing data for experienced bioinformaticians, as well for wet-lab biologists with minimum bioinformatics knowledge. Moreover, for the latter group, we describe the basic algorithm necessary for MinION sequencing analysis from the first to last step.
Amoebozoans are in many aspects interesting research objects, as they combine features of single-cell organisms with complex signaling and defense systems, comparable to multicellular organisms. Acanthamoeba castellanii is a cosmopolitan species and developed diverged feeding abilities and strong anti-bacterial resistance; Entamoeba histolytica is a parasitic amoeba, who underwent massive gene loss and its genome is almost twice smaller than that of A. castellanii. Nevertheless, both species prosper, demonstrating fitness to their specific environments. Here we compare transcriptomes of A. castellanii and E. histolytica with application of orthologs' search and gene ontology to learn how different life strategies influence genome evolution and restructuring of physiology. A. castellanii demonstrates great metabolic activity and plasticity, while E. histolytica reveals several interesting features in its translational machinery, cytoskeleton, antioxidant protection, and nutritional behavior. In addition, we suggest new features in E. histolytica physiology that may explain its successful colonization of human colon and may facilitate medical research.
Upstream open reading frame (uORF)-mediated translational control has emerged as an important regulatory mechanism in human health and disease. However, a systematic search for cancer-associated somatic uORF mutations has not been performed. Here, we analyzed the genetic variability at canonical (uAUG) and alternative translational initiation sites (aTISs), as well as the associated upstream termination codons (uStops) in 3394 whole-exome-sequencing datasets from patient samples of breast, colon, lung, prostate, and skin cancer and of acute myeloid leukemia, provided by The Cancer Genome Atlas research network. We found that 66.5% of patient samples were affected by at least one of 5277 recurrent uORF-associated somatic single nucleotide variants altering 446 uAUG, 347 uStop, and 4733 aTIS codons. While twelve uORF variants were detected in all entities, 17 variants occurred in all five types of solid cancer analyzed here. Highest frequencies of individual somatic variants in the TLSs of NBPF20 and CHCHD2 reached 10.1% among LAML and 8.1% among skin cancer patients, respectively. Functional evaluation by dual luciferase reporter assays identified 19 uORF variants causing significant translational deregulation of the associated main coding sequence, ranging from 1.73-fold induction for an AUG.1 > UUG variant in SETD4 to 0.006-fold repression for a CUG.6 > GUG variant in HLA-DRB1. These data suggest that somatic uORF mutations are highly prevalent in human malignancies and that defective translational regulation of protein expression may contribute to the onset or progression of cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.