Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this, a large number of specialized simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce msprime version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and the tskit library. We summarize msprime’s many features, and show that its performance is excellent, often many times faster and more memory efficient than specialized alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement.
We reported HIVID (high-throughput Viral Integration Detection), a novel experimental and computational method to detect the location of Hepatitis B Virus (HBV) integration breakpoints in Hepatocellular Carcinoma (HCC) genome. In this method, the fragments with HBV sequence were enriched by a set of HBV probes and then processed to high-throughput sequencing. In order to evaluate the performance of HIVID, we compared the results of HIVID with that of whole genome sequencing method (WGS) in 28 HCC tumors. We detected a total of 246 HBV integration breakpoints in HCC genome, 113 out of which were within 400bp upstream or downstream of 125 breakpoints identified by WGS method, covering 89.3% (125/140) of total breakpoints. The integration was located in the gene TERT, MLL4, and CCNE1. In addition, we discovered 133 novel breakpoints missed by WGS method, with 66.7% (10/15) of validation rate. Our study shows HIVID is a cost-effective methodology with high specificity and sensitivity to identify viral integration in human genome.
Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this necessity, a large number of specialised simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce msprime version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and tskit library. We summarise msprime's many features, and show that its performance is excellent, often many times faster and more memory efficient than specialised alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement.
Melanisation has been considered to be an important virulence factor of Fonsecaea monophora. However, the biosynthetic mechanisms of melanisation remain unknown. We therefore used next generation sequencing technology to investigate the transcriptome and digital gene expression data, which are valuable resources to better understand the molecular and biological mechanisms regulating melanisation in F. monophora. We performed de novo transcriptome assembly and digital gene expression (DGE) profiling analyses of parent (CBS 122845) and albino (CBS 125194) strains using the Illumina RNA-seq system. A total of 17 352 annotated unigenes were found by BLAST search of NR, Swiss-Prot, Gene Ontology, Clusters of Orthologous Groups and Kyoto Encyclopedia of Genes and Genomes (KEGG) (E-value <1e‒5). A total of 2 283 unigenes were judged to be the differentially expressed between the two genotypes. We identified most of the genes coding for key enzymes involved in melanin biosynthesis pathways, including polyketide synthase (pks), multicopper oxidase (mco), laccase, tyrosinase and homogentisate 1,2-dioxygenase (hmgA). DEG analysis showed extensive down-regulation of key genes in the DHN pathway, while up-regulation was noted in the DOPA pathway of the albino mutant. The transcript levels of partial genes were confirmed by real time RT-PCR, while the crucial role of key enzymes was confirmed by either inhibitor or substrate tests in vitro. Meanwhile, numbers of genes involved in light sensing, cell wall synthesis, morphology and environmental stress were identified in the transcriptome of F. monophora. In addition, 3 353 SSRs (Simple Sequence Repeats) markers were identified from 21 600 consensus sequences. Blocking of the DNH pathway is the most likely reason of melanin deficiency in the albino strain, while the production of pheomelanin and pyomelanin were probably regulated by unknown transcription factors on upstream of both pathways. Most of genes involved in environmental tolerance to oxidants, irradiation and extreme temperatures were also assembled and annotated in transcriptomes of F. monophora. In addition, thousands of identified cSSR (combined SSR) markers will favour further genetic linkage studies. In conclusion, these data will contribute to understanding the regulation of melanin biosynthesis and help to improve the studies of pathogenicity of F. monophora.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.