Quality control (QC) and preprocessing are essential steps for sequencing data analysis to ensure the accuracy of results. However, existing tools cannot provide a satisfying solution with integrated comprehensive functions, proper architectures, and highly scalable acceleration. In this article, we demonstrate SOAPnuke as a tool with abundant functions for a “QC-Preprocess-QC” workflow and MapReduce acceleration framework. Four modules with different preprocessing functions are designed for processing datasets from genomic, small RNA, Digital Gene Expression, and metagenomic experiments, respectively. As a workflow-like tool, SOAPnuke centralizes processing functions into 1 executable and predefines their order to avoid the necessity of reformatting different files when switching tools. Furthermore, the MapReduce framework enables large scalability to distribute all the processing works to an entire compute cluster.We conducted a benchmarking where SOAPnuke and other tools are used to preprocess a ∼30× NA12878 dataset published by GIAB. The standalone operation of SOAPnuke struck a balance between resource occupancy and performance. When accelerated on 16 working nodes with MapReduce, SOAPnuke achieved ∼5.7 times the fastest speed of other tools.
Species of the Bos genus, including taurine cattle, zebu, gayal, gaur, banteng, yak, wisent and bison, have been domesticated at least four times and have been an important source of meat, milk and power for many human cultures. We sequence the genomes of gayal, gaur, banteng, wisent and bison, and provide population genomic sequencing of an additional 98 individuals. We use these data to determine the phylogeny and evolutionary history of these species and show that the threatened gayal is an independent species or subspecies. We show that there has been pronounced introgression among different members of this genus, and that it in many cases has involved genes of considerable adaptive importance. For example, genes under domestication selection in cattle (for example, MITF) were introgressed from domestic cattle to yak. Also, genes in the response-to-hypoxia pathway (for example, EGLN1, EGLN2 and HIF3a) have been introgressed from yak to Tibetan cattle, probably facilitating their adaptation to high altitude. We also validate that there is an association between the introgressed EGLN1 allele and haemoglobin and red blood cell concentration. Our results illustrate the importance of introgression as a source of adaptive variation and during domestication, and suggest that the Bos genus evolves as a complex of genetically interconnected species with shared evolutionary trajectories.
Determining the composition and function of subgingival dental plaque is crucial to understanding human periodontal health and disease, but it is challenging because of the complexity of the interactions between human microbiomes and human body. Here, we examined the phylogenetic and functional gene differences between periodontal and healthy individuals using MiSeq sequencing of 16S rRNA gene amplicons and a specific functional gene array (a combination of GeoChip 4.0 for biogeochemical processes and HuMiChip 1.0 for human microbiomes). Our analyses indicated that the phylogenetic and functional gene structure of the oral microbiomes were distinctly different between periodontal and healthy groups. Also, 16S rRNA gene sequencing analysis indicated that 39 genera were significantly different between healthy and periodontitis groups, and Fusobacterium, Porphyromonas, Treponema, Filifactor, Eubacterium, Tannerella, Hallella, Parvimonas, Peptostreptococcus and Catonella showed higher relative abundances in the periodontitis group. In addition, functional gene array data showed that a lower gene number but higher signal intensity of major genes existed in periodontitis, and a variety of genes involved in virulence factors, amino acid metabolism and glycosaminoglycan and pyrimidine degradation were enriched in periodontitis, suggesting their potential importance in periodontal pathogenesis. However, the genes involved in amino acid synthesis and pyrimidine synthesis exhibited a significantly lower relative abundance compared with healthy group. Overall, this study provides new insights into our understanding of phylogenetic and functional gene structure of subgingival microbial communities of periodontal patients and their importance in pathogenesis of periodontitis.
Active DNA demethylation in plants occurs through base excision repair, beginning with removal of methylated cytosine by the ROS1/DME subfamily of 5-methylcytosine DNA glycosylases. Active DNA demethylation in animals requires the DNA glycosylase TDG or MBD4, which functions after oxidation or deamination of 5-methylcytosine, respectively. However, little is known about the steps following DNA glycosylase action in the active DNA demethylation pathways in plants and animals. We show here that the Arabidopsis APE1L protein has apurinic/apyrimidinic endonuclease activities and functions downstream of ROS1 and DME. APE1L and ROS1 interact in vitro and co-localize in vivo. Whole genome bisulfite sequencing of ape1l mutant plants revealed widespread alterations in DNA methylation. We show that the ape1l/zdp double mutant displays embryonic lethality. Notably, the ape1l+/−zdp−/− mutant shows a maternal-effect lethality phenotype. APE1L and the DNA phosphatase ZDP are required for FWA and MEA gene imprinting in the endosperm and are important for seed development. Thus, APE1L is a new component of the active DNA demethylation pathway and, together with ZDP, regulates gene imprinting in Arabidopsis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.