Although gut microbiome dysbiosis has been illustrated in celiac disease (CD), there are disagreements about what constitutes these microbial signatures and the timeline by which they precede diagnosis is largely unknown. The study of high-genetic-risk patients or those already with CD limits our knowledge of dysbiosis that may occur early in life in a generalized population. To explore early gut microbial imbalances correlated with future celiac disease (fCD), we analyzed the stool of 1478 infants aged one year, 26 of whom later acquired CD, with a mean age of diagnosis of 10.96 ± 5.6 years. With a novel iterative control-matching algorithm using the prospective general population cohort, All Babies In Southeast Sweden, we found nine core microbes with prevalence differences and seven differentially abundant bacteria between fCD infants and controls. The differences were validated using 100 separate, iterative permutations of matched controls, which suggests the bacterial signatures are significant in fCD even when accounting for the inherent variability in a general population. This work is the first to our knowledge to demonstrate that gut microbial differences in prevalence and abundance exist in infants aged one year up to 19 years before a diagnosis of CD in a general population.
Despite the advent of third-generation sequencing technologies, modern bacterial ecology studies still use Illumina to sequence small (~400 bp) hypervariable regions of the 16S rRNA SSU for phylogenetic classification. By sequencing a larger region of the rRNA gene operons, the limitations and biases of sequencing small portions can be removed, allowing for more accurate classification with deeper taxonomic resolution. With Nanopore sequencing now providing raw simplex reads with quality scores above Q20 using the kit 12 chemistry, the ease, cost, and portability of Nanopore play a leading role in performing differential bacterial abundance analysis. Sequencing the near-entire rrn operon of bacteria and archaea enables the use of the universally conserved operon holding evolutionary polymorphisms for taxonomic resolution. Here, a reproducible and validated pipeline was developed, RRN-operon Enabled Species-level Classification Using EMU (RESCUE), to facilitate the sequencing of bacterial rrn operons and to support import into phyloseq. Benchmarking RESCUE showed that fully processed reads are now parallel or exceed the quality of Sanger, with median quality scores of approximately Q20+, using the R10.4 and Guppy SUP basecalling. The pipeline was validated through two complex mock samples, the use of multiple sample types, with actual Illumina data, and across four databases. RESCUE sequencing is shown to drastically improve classification to the species level for most taxa and resolves erroneous taxa caused by using short reads such as Illumina.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.