A major challenge in the field of shotgun metagenomics is the accurate identification
of organisms present within a microbial community, based on classification of short
sequence reads. Though existing microbial community profiling methods have attempted
to rapidly classify the millions of reads output from modern sequencers, the
combination of incomplete databases, similarity among otherwise divergent genomes,
errors and biases in sequencing technologies, and the large volumes of sequencing
data required for metagenome sequencing has led to unacceptably high false discovery
rates (FDR). Here, we present the application of a novel, gene-independent and
signature-based metagenomic taxonomic profiling method with significantly and
consistently smaller FDR than any other available method. Our algorithm circumvents
false positives using a series of non-redundant signature databases and examines
Genomic Origins
Through Taxonomic
CHAllenge (GOTTCHA). GOTTCHA was tested and validated on 20
synthetic and mock datasets ranging in community composition and complexity, was
applied successfully to data generated from spiked environmental and clinical
samples, and robustly demonstrates superior performance compared with other available
tools.
Continued advancements in sequencing technologies have fueled the development of new sequencing applications and promise to flood current databases with raw data. A number of factors prevent the seamless and easy use of these data, including the breadth of project goals, the wide array of tools that individually perform fractions of any given analysis, the large number of associated software/hardware dependencies, and the detailed expertise required to perform these analyses. To address these issues, we have developed an intuitive web-based environment with a wide assortment of integrated and cutting-edge bioinformatics tools in pre-configured workflows. These workflows, coupled with the ease of use of the environment, provide even novice next-generation sequencing users with the ability to perform many complex analyses with only a few mouse clicks and, within the context of the same environment, to visualize and further interrogate their results. This bioinformatics platform is an initial attempt at Empowering the Development of Genomics Expertise (EDGE) in a wide range of applications for microbial research.
The genus Burkholderia encompasses both pathogenic (including Burkholderia mallei and Burkholderia pseudomallei, U.S. Centers for Disease Control and Prevention Category B listed), and nonpathogenic Gram-negative bacilli. Here we present full genome sequences for a panel of 59 Burkholderia strains, selected to aid in detection assay development.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.