Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled .94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.A major opportunity for a sustainable energy and biomaterials economy in many parts of the world lies in a better understanding of the molecular basis of superior growth and adaptation in woody plants. Part of this opportunity involves species of Eucalyptus L'Hér, a genus of woody perennials native to Australia 1 . The remarkable adaptability of eucalypts coupled with their fast growth and superior wood properties has driven their rapid adoption for plantation forestry in more than 100 countries across six continents (.20 million ha) 2 , making eucalypts the most widely planted hardwood forest trees in the world. The subtropical E. grandis and the temperate E. globulus stand out as targets of breeding programmes worldwide. Planted eucalypts provide key renewable resources for the production of pulp, paper, biomaterials and bioenergy, while mitigating human pressures on native forests 3 . Eucalypts also have a large diversity and high concentration of essential oils (mixtures of mono-and sesquiterpenes), many of which have ecological functions as well as medicinal and industrial uses. Predominantly outcrossers 1 with hermaphroditic animal-pollinated flowers, eucalypts are highly heterozygous and display pre-and postzygotic barriers to selfing to reduce inbreeding depression for fitness and survival 4 .To mitigate the challenge of assembling a highly heterozygous genome, we sequenced the genome of 'BRASUZ1', a 17-year-old E. grandis genotype derived from one generation of selfing. The availability of annotated forest tree genomes from two separately evolving rosid lineages, Eucalyptus (order Myrtales) and Populus (order Malpighiales 5 ), in combination with genomes from domesticated woody plants (for example, Vitis, Prunus, Citrus), provides a comparative foundation for addressing
BackgroundDe novo assembly of transcript sequences produced by short-read DNA sequencing technologies offers a rapid approach to obtain expressed gene catalogs for non-model organisms. A draft genome sequence will be produced in 2010 for a Eucalyptus tree species (E. grandis) representing the most important hardwood fibre crop in the world. Genome annotation of this valuable woody plant and genetic dissection of its superior growth and productivity will be greatly facilitated by the availability of a comprehensive collection of expressed gene sequences from multiple tissues and organs.ResultsWe present an extensive expressed gene catalog for a commercially grown E. grandis × E. urophylla hybrid clone constructed using only Illumina mRNA-Seq technology and de novo assembly. A total of 18,894 transcript-derived contigs, a large proportion of which represent full-length protein coding genes were assembled and annotated. Analysis of assembly quality, length and diversity show that this dataset represent the most comprehensive expressed gene catalog for any Eucalyptus tree. mRNA-Seq analysis furthermore allowed digital expression profiling of all of the assembled transcripts across diverse xylogenic and non-xylogenic tissues, which is invaluable for ascribing putative gene functions.ConclusionsDe novo assembly of Illumina mRNA-Seq reads is an efficient approach for transcriptome sequencing and profiling in Eucalyptus and other non-model organisms. The transcriptome resource (Eucspresso, http://eucspresso.bi.up.ac.za/) generated by this study will be of value for genomic analysis of woody biomass production in Eucalyptus and for comparative genomic analysis of growth and development in woody and herbaceous plants.
Higher plants contain a family of cellulose synthase catalytic subunit (CesA) genes that encode components of an enzyme complex embedded in the cell membrane. Recent studies in several higher plant species have demonstrated that two groups of CesA genes exist, associated with either primary or secondary cell wall deposition. We cloned six full-length CesA cDNAs from Eucalyptus grandis W. Hill ex Maiden (EgCesA1 through 6) and determined their expression patterns in a variety of organs from an adult tree. The six EgCesA genes encode predicted proteins of 978 to 1097 amino acid residues, each of which contains all of the key regions and motifs characteristic of functional CESA proteins. The predicted proteins share limited amino acid identity with each other, ranging from 61 to 70%. In contrast, similar CESA proteins from higher plant species exhibit 81 to 90% identity with the six EgCESAs. Gene expression analysis using quantitative reverse-transcription polymerase chain reaction indicated that transcripts of EgCesA1 to 3 were abundant in tissues enriched with cells laying down secondary cell walls (e.g., xylem), but were weakly expressed in tissues undergoing primary growth (e.g., unfolding leaves). Expression of EgCesA4 and EgCesA5 was upregulated in tissues rich in rapidly dividing cells undergoing primary wall synthesis, whereas EgCesA6 was weakly expressed in all tissues analyzed. These results suggest that Eucalyptus, like other higher plants, expresses two contrasting groups of apparently co-regulated CesAs involved in either primary or secondary cell wall biosynthesis.
SummaryAs a step toward functional annotation of genes required for floral initiation and development within the Eucalyptus genome, we used short read sequencing to analyze transcriptomes of floral buds from early and late developmental stages, and compared these with transcriptomes of diverse vegetative tissues, including leaves, roots, and stems.A subset of 4807 genes (13% of protein-coding genes) were differentially expressed between floral buds of either stage and vegetative tissues. A similar proportion of genes were differentially expressed among all tissues. A total of 479 genes were differentially expressed between early and late stages of floral development. Gene function enrichment identified 158 gene ontology classes that were overrepresented in floral tissues, including 'pollen development' and 'aromatic compound biosynthetic process'. At least 40 floral-dominant genes lacked functional annotations and thus may be novel floral transcripts.We analyzed several genes and gene families in depth, including 49 putative biomarkers of floral development, the MADS-box transcription factors, 'S-domain'-receptor-like kinases, and selected gene family members with phosphatidylethanolamine-binding protein domains. Expanded MADS-box gene subfamilies in Eucalyptus grandis included SUPPRESSOR OF OVEREXPRESSION OF CO 1 (SOC1), SEPALLATA (SEP) and SHORT VEGETATIVE PHASE (SVP) Arabidopsis thaliana homologs.These data provide a rich resource for functional and evolutionary analysis of genes controlling eucalypt floral development, and new tools for breeding and biotechnology.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.