BackgroundWhole genome sequencing (WGS) is becoming increasingly prevalent for molecular diagnosis, staging and prognosis because of its declining costs and the ability to detect nearly all genes associated with a patient’s disease. The currently widely accepted variant calling pipeline, GATK, is limited in terms of its computational speed and efficiency, which cannot meet the growing analysis needs.MethodsIn this study, we propose a fast and accurate DNASeq variant calling workflow that is purely composed of tools from LUSH toolkit. The LUSH pipeline is highly optimized for the WGS pipeline based on SOAPnuke, BWA and GATK which can be deployed on any general-purpose CPU-based computing system. We validated the accuracy, speed and scalability of the LUSH pipeline on several standard WGS datasets.ResultsOur test results show that the LUSH pipeline and the GATK pipeline are highly consistent in terms of accuracy, achieving over 99% precision and recall on NA12878. For speed, the LUSH pipeline completes 30x WGS data in 1.6 hours, which is about 17x faster than the GATK pipeline. From BAM to VCF, LUSH_HC even takes only 12 minutes, about 76x faster than GATK. Moreover, the LUSH pipeline shows favorable scalability in terms of thread and sequencing depth.ConclusionThe LUSH pipeline provides far superior computational speed to GATK while maintaining a high level of accuracy comparable to that of GATK, which greatly facilitates bedside analysis of acute patients, large-scale cohort data analysis, and variant calling in crop breeding programs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.