Next-generation sequencing (NGS) technology has shown promise for the detection of human pathogens from clinical samples. However, one of the major obstacles to the use of NGS in diagnostic microbiology is the low ratio of pathogen DNA to human DNA in most clinical specimens. In this study, we aimed to develop a specimen-processing protocol to remove human DNA and enrich specimens for bacterial and viral DNA for shotgun metagenomic sequencing. Cerebrospinal fluid (CSF) and nasopharyngeal aspirate (NPA) specimens, spiked with control bacterial and viral pathogens, were processed using either a commercially available kit (MolYsis) or various detergents followed by DNase prior to the extraction of DNA. Relative quantities of human DNA and pathogen DNA were determined by real-time PCR. The MolYsis kit did not improve the pathogen-to-human DNA ratio, but significant reductions (>95%; P < 0.001) in human DNA with minimal effect on pathogen DNA were achieved in samples that were treated with 0.025% saponin, a nonionic surfactant. Specimen preprocessing significantly decreased NGS reads mapped to the human genome (P < 0.05) and improved the sensitivity of pathogen detection (P < 0.01), with a 20-to 650-fold increase in the ratio of microbial reads to human reads. Preprocessing also permitted the detection of pathogens that were undetectable in the unprocessed samples. Our results demonstrate a simple method for the reduction of background human DNA for metagenomic detection for a broad range of pathogens in clinical samples.
Clinical microbiology is one of the most rapidly changing areas of laboratory medicine today due to the introduction of new technologies and automation (1). Molecular testing, such as PCR, is becoming the de facto gold standard for the detection of pathogens that are difficult to culture by offering high sensitivity and specificity and a rapid turnaround time (2). Syndrome-based multiplex molecular assays can detect up to 30 of the most common pathogens associated with respiratory infections, gastroenteritis, and central nervous system (CNS) infections (3-6). However, the complete list of infectious agents associated with these infections greatly exceeds the capabilities of even the best multiplex assays. These less common organisms, for which tests are not readily available, are likely responsible for many cases of undiagnosed illness, particularly in critically ill patients and those with compromised immunity (7,8). Therefore, there is increasing interest in the application of novel technologies, such as next-generation sequencing (NGS), for unbiased detection of pathogens in clinical samples.Among the various challenges with the implementation of NGS for routine pathogen detection using metagenomics, the presence of an overwhelming amount of host DNA is one of the most important problems to be addressed. A previous metagenomic study on nasopharyngeal aspirate samples from patients with acute lower respiratory tract infections revealed that up to ϳ95% of raw NGS reads were of human DNA (9). The su...