Eucalyptus is an important short rotation pulpy woody plant, grown widely in the tropics. Recently, many genomic programmes are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. These sequences can be utilized for analysis of simple sequence repeats (SSRs) and single nucleotide polymorphism (SNPs) available in the transcribed genes. In this study, in silico analysis of 15,285 sequences representing partial and full-length mRNA from Eucalyptus species for their use in developing SSRs or microsatellites were carried out. A total of 875 EST-SSRs were identified from 772 SSR containing ESTs. Motif size of 6 for dinucleotide and 5 for trinucleotide, tetranucleotide, and pentanucleotides were considered in locating the microsatellites. The average frequency of identified SSRs was 12.9%. The dinucleotide repeats were the most abundant among the dinucleotide, trinucleotide and tetranucleotide motifs and accounted for 50.9% of the Eucalyptus genome. Primer designing analysis showed that 571 sequences with SSRs had sufficient flanking regions for polymerase chain reaction (PCR) primer synthesis. Evaluation of the usefulness of the SSRs showed that EST-derived SSRs can generate polymorphic markers as all the primers showed allelic diversity among the 16 provenances of E. tereticornis.
Eucalyptus camaldulensis and E. tereticornis are closely related species commonly cultivated for pulp wood in many tropical countries including India. Understanding the genetic structure and linkage disequilibrium (LD) existing in these species is essential for the improvement of industrially important traits. Our goal was to evaluate the use of simple sequence repeat (SSR) loci for species discrimination, population structure and LD analysis in these species. Investigations were carried out with the most common alleles in 93 accessions belonging to these two species using 62 SSR markers through cross amplification. The polymorphic information content (PIC) ranged from 0.44 to 0.93 and 0.36 to 0.93 in E. camaldulensis and E. tereticornis respectively. A clear delineation between the two species was evident based on the analysis of population structure and species-specific alleles. Significant genotypic LD was found in E. camaldulensis, wherein out of 135 significant pairs, 17 pairs showed r2≥0.1. Similarly, in E. tereticornis, out of 136 significant pairs, 18 pairs showed r2≥0.1. The extent of LD decayed rapidly showing the significance of association analyses in eucalypts with higher resolution markers. The availability of whole genome sequence for E. grandis and the synteny and co-linearity in the genome of eucalypts, will allow genome-wide genotyping using microsatellites or single nucleotide polymorphims.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.