Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within-and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.Asian cultivated rice is grown worldwide and comprises the staple food for half of the global population. It is envisaged that by the year 2035 1 feeding this growing population will necessitate that an additional 112 million metric tons of rice be produced on a smaller area of land, using less water and under more fluctuating climatic conditions, which will require that future rice cultivars be higher yielding and resilient to multiple abiotic and biotic stresses. The foundation of the continued improvement of rice cultivars is the rich genetic diversity within domesticated populations and wild relatives [2][3][4] . For over 2,000 years, two major types of O. sativa-O. sativa Xian group (here referred to as Xian/Indica (XI) and also known as , Hsien or Indica) and O. sativa Geng Group (here referred to as Geng/Japonica (GJ) and also known as , Keng or Japonica)-have historically been recognized [5][6][7] . Varied degrees of post-reproductive barriers exist between XI and GJ rice accessions 8 ; this differentiation between XI and GJ rice types and the presence of different varietal groups are well-documented at isozyme and DNA levels 6,9 . Two other distinct groups have also been recognized using molecular markers 10 ; one of these encompasses the Aus, Boro and Rayada ecotypes from Bangladesh and India (which we term the circum-Aus group (cA)) and the other comprises the famous Basmati and Sadri aromatic varieties (which we term the circum-Basmati group (cB)).Approximately 780,000 rice accessions are available in gene banks worldwide 11 . To enable the more efficient use of these accessions in future rice improvement, the Chinese Academy of Agricultural Sciences, BGI-Shenzhen and International Rice Research Institute sequenced over 3,000 rice genomes (3K-RG) as part of the 3,000 Rice Genomes Project 12. Here we present analyses of genetic variation in the 3K-RG that focus on important aspects of O. sativa diversity, single nucleotide polymorphisms (SNPs) and structural variation (deletions, duplications, inversions and translocations). We also construct a species pangenome consisting of 'core...
The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.
Both genetic variations and diet-disrupted gut microbiota can predispose animals to metabolic syndromes (MS). This study assessed the relative contributions of host genetics and diet in shaping the gut microbiota and modulating MS-relevant phenotypes in mice. Together with its wild-type (Wt) counterpart, the Apoa-I knockout mouse, which has impaired glucose tolerance (IGT) and increased body fat, was fed a high-fat diet (HFD) or normal chow (NC) diet for 25 weeks. DNA fingerprinting and bar-coded pyrosequencing of 16S rRNA genes were used to profile gut microbiota structures and to identify the key population changes relevant to MS development by Partial Least Square Discriminate Analysis. Diet changes explained 57% of the total structural variation in gut microbiota, whereas genetic mutation accounted for no more than 12%. All three groups with IGT had significantly different gut microbiota relative to healthy Wt/NC-fed animals. In all, 65 species-level phylotypes were identified as key members with differential responses to changes in diet, genotype and MS phenotype. Most notably, gut barrier-protecting Bifidobacterium spp. were nearly absent in all animals on HFD, regardless of genotype. Sulphate-reducing, endotoxin-producing bacteria of the family, Desulfovibrionaceae, were enhanced in all animals with IGT, most significantly in the Wt/HFD group, which had the highest calorie intake and the most serious MS phenotypes. Thus, diet has a dominating role in shaping gut microbiota and changes of some key populations may transform the gut microbiota of Wt animals into a pathogen-like entity relevant to development of MS, despite a complete host genome.
Oral microbiota plays a vital role in maintaining the homeostasis of oral cavity. Dental caries are among the most common oral diseases in children and pathogenic bacteria contribute to the development of the disease. However, the overall structure of bacterial communities in the oral cavity from children with dental caries has not been explored deeply heretofore. We used high-throughput barcoded pyrosequencing and PCR-denaturing gradient gel electrophoresis (DGGE) to examine bacterial diversity of oral microbiota in saliva and supragingival plaques from 60 children aged 3 to 6 years old with and without dental caries from China. The multiplex barcoded pyrosequencing was performed in a single run, with multiple samples tagged uniquely by multiplex identifiers. As PCR-DGGE analysis is a conventional molecular ecological approach, this analysis was also performed on the same samples and the results of both approaches were compared. A total of 186,787 high-quality sequences were obtained for evaluating bacterial diversity and 41,905 unique sequences represented all phylotypes. We found that the oral microbiota in children was far more diverse than previous studies reported, and more than 200 genera belonging to ten phyla were found in the oral cavity. The phylotypes in saliva and supragingival plaques were significantly different and could be divided into two distinct clusters (p < 0.05). The bacterial diversity in oral microbiome analyzed by PCR-DGGE and barcoded pyrosequencing was employed to cross validate the data sets. The genera of Streptococcus, Veillonella, Actinomyces, Granulicatella, Leptotrichia, and Thiomonas in plaques were significantly associated with dental caries (p < 0.05). The results showed that there was no one specific pathogen but rather pathogenic populations in plaque that significantly correlated with dental caries. The enormous diversity of oral microbiota allowed for a better understanding of oral microecosystem, and these pathogenic populations in plaque provide new insights into the etiology of dental caries and suggest new targets for interventions of the disease.
A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.