This study describes and validates a new method for metagenomic biomarker discovery by way of class comparison, tests of biological consistency and effect size estimation. This addresses the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities, which is a central problem to the study of metagenomics. We extensively validate our method on several microbiomes and a convenient online interface for the method is provided at http://huttenhower.sph.harvard.edu/lefse/.
Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin, and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics, and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analyzed the largest cohort and set of distinct, clinically relevant body habitats to date. We found the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81–99% of the genera, enzyme families, and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology, and translational applications of the human microbiome.
The human oral cavity contains a number of different habitats, including the teeth, gingival sulcus, tongue, cheeks, hard and soft palates, and tonsils, which are colonized by bacteria. The oral microbiome is comprised of over 600 prevalent taxa at the species level, with distinct subsets predominating at different habitats. The oral microbiome has been extensively characterized by cultivation and culture-independent molecular methods such as 16S rRNA cloning. Unfortunately, the vast majority of unnamed oral taxa are referenced by clone numbers or 16S rRNA GenBank accession numbers, often without taxonomic anchors. The first aim of this research was to collect 16S rRNA gene sequences into a curated phylogeny-based database, the Human Oral Microbiome Database (HOMD), and make it web accessible (www.homd.org). The HOMD includes 619 taxa in 13 phyla, as follows: Actinobacteria, Bacteroidetes, Chlamydiae, Chloroflexi, Euryarchaeota, Firmicutes, Fusobacteria, Proteobacteria, Spirochaetes, SR1, Synergistetes, Tenericutes, and TM7. The second aim was to analyze 36,043 16S rRNA gene clones isolated from studies of the oral microbiota to determine the relative abundance of taxa and identify novel candidate taxa. The analysis identified 1,179 taxa, of which 24% were named, 8% were cultivated but unnamed, and 68% were uncultivated phylotypes. Upon validation, 434 novel, nonsingleton taxa will be added to the HOMD. The number of taxa needed to account for 90%, 95%, or 99% of the clones examined is 259, 413, and 875, respectively. The HOMD is the first curated description of a human-associated microbiome and provides tools for use in understanding the role of the microbiome in health and disease.
A variety of microbial communities and their genes (microbiome) exist throughout the human body, playing fundamental roles in human health and disease. The NIH funded Human Microbiome Project (HMP) Consortium has established a population-scale framework which catalyzed significant development of metagenomic protocols resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 to 18 body sites up to three times, which to date, have generated 5,177 microbial taxonomic profiles from 16S rRNA genes and over 3.5 Tb of metagenomic sequence. In parallel, approximately 800 human-associated reference genomes have been sequenced. Collectively, these data represent the largest resource to date describing the abundance and variety of the human microbiome, while providing a platform for current and future studies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.