Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 nontransposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production.
Background: Rice is an important staple food and, with the smallest cereal genome, serves as a reference species for studies on the evolution of cereals and other grasses. Therefore, decoding its entire genome will be a prerequisite for applied and basic research on this species and all other cereals.
The high-quality rice genome sequence is serving as a reference for comparative genome analysis in crop plants, especially cereals. However, early comparisons with bread wheat showed complex patterns of conserved synteny (gene content) and colinearity (gene order). Here, we show the presence of ancient duplicated segments in the progenitor of wheat, which were first identified in the rice genome. We also show that single-copy (SC) rice genes, those representing unique matches with wheat expressed sequence tag (EST) unigene contigs in the whole rice genome, show more than twice the proportion of genes mapping to syntenic wheat chromosome as compared to the multicopy (MC) or duplicated rice genes. While 58.7% of the 1,244 mapped SC rice genes were located in single syntenic wheat chromosome groups, the remaining 41.3% were distributed randomly to the other six non-syntenic wheat groups. This could only be explained by a background dispersal of genes in the genome through transposition or other unknown mechanism. The breakdown of rice-wheat synteny due to such transpositions was much greater near the wheat centromeres. Furthermore, the SC rice genes revealed a conserved primordial gene order that gives clues to the origin of rice and wheat chromosomes from a common ancestor through polyploidy, aneuploidy, centromeric fusions, and translocations. Apart from the bin-mapped wheat EST contigs, we also compared 56,298 predicted rice genes with 39,813 wheat EST contigs assembled from 409,765 EST sequences and identified 7,241 SC rice gene homologs of wheat. Based on the conserved colinearity of 1,063 mapped SC rice genes across the bins of individual wheat chromosomes, we predicted the wheat bin location of 6,178 unmapped SC rice gene homologs and validated the location of 213 of these in the telomeric bins of 21 wheat chromosomes with 35.4% initial success. This opens up the possibility of directed mapping of a large number of conserved SC rice gene homologs in wheat. Overall, only 46.4% of these SC genes code for proteins with known functional domains; the remaining 53.6% have unknown function, and hence, represent an important, but yet, under explored category of genes.
The DNA sequence of 106 BAC/PAC clones in the minimum tiling path (MTP) of the long arm of rice chromosome 11, between map positions 57.3 and 116.2 cM, has been assembled to phase 2 or PLN level. This region has been sequenced to 10x redundancy by the Indian Initiative for Rice Genome Sequencing (IIRGS) and is now publicly available in GenBank. The region, excluding overlaps, has been predicted to contain 2,932 genes using different software. A gene-by-gene BLASTN search of the NCBI wheat EST database of over 420,000 cDNA sequences revealed that 1,143 of the predicted rice genes (38.9%) have significant homology to wheat ESTs (bit score >/= 100). Further BLASTN search of these 1,143 rice genes with the GrainGenes database of sequence contigs containing bin-mapped wheat ESTs allowed 113 of the genes to be placed in bins located on wheat chromosomes of different homoeologous groups. The largest number of genes, about one-third, mapped to the homoeologous group 4 chromosomes of wheat, suggesting a common evolutionary origin. The remaining genes were located on wheat chromosomes of different groups with significantly higher numbers for groups 3 and 5. Location of bin-mapped wheat contigs to chromosomes of all the seven homoeologous groups can be ascribed to movement of genes (transpositions) or chromosome segments (translocations) within rice or the hexaploid wheat genomes. Alternatively, it could be due to ancient duplications in the common ancestral genome of wheat and rice followed by selective elimination of genes in the wheat and rice genomes. While there exists definite conservation of gene sequences and the ancestral chromosomal identity between rice and wheat, there is no obvious conservation of the gene order at this level of resolution. Lack of extensive colinearity between rice and wheat genomes suggests that there have been many insertions, deletions, duplications and translocations that make the synteny comparisons much more complicated than earlier thought. However, enhanced resolution of comparative sequence analysis may reveal smaller conserved regions of colinearity, which will facilitate selection of markers for saturation mapping and sequencing of the gene-rich regions of the wheat genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.