Complete chromosome/genome sequences available from humans, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, and Saccharomyces cerevisiae were analyzed for the occurrence of mono-, di-, tri-, and tetranucleotide repeats. In all of the genomes studied, dinucleotide repeat stretches tended to be longer than other repeats. Additionally, tetranucleotide repeats in humans and trinucleotide repeats in Drosophila also seemed to be longer. Although the trends for different repeats are similar between different chromosomes within a genome, the density of repeats may vary between different chromosomes of the same species. The abundance or rarity of various di- and trinucleotide repeats in different genomes cannot be explained by nucleotide composition of a sequence or potential of repeated motifs to form alternative DNA structures. This suggests that in addition to nucleotide composition of repeat motifs, characteristic DNA replication/repair/recombination machinery might play an important role in the genesis of repeats. Moreover, analysis of complete genome coding DNA sequences of Drosophila, C. elegans, and yeast indicated that expansions of codon repeats corresponding to small hydrophilic amino acids are tolerated more, while strong selection pressures probably eliminate codon repeats encoding hydrophobic and basic amino acids. The locations and sequences of all of the repeat loci detected in genome sequences and coding DNA sequences are available at http://www.ncl-india.org/ssr and could be useful for further studies.
All the protein sequences from SWISS-PROT database were analyzed for occurrence of single amino acid repeats, tandem oligo-peptide repeats, and periodically conserved amino acids. Single amino acid repeats of glutamine, serine, glutamic acid, glycine, and alanine seem to be tolerated to a considerable extent in many proteins. Tandem oligo-peptide repeats of different types with varying levels of conservation were detected in several proteins and found to be conspicuous, particularly in structural and cell surface proteins. It appears that repeated sequence patterns may be a mechanism that provides regular arrays of spatial and functional groups, useful for structural packing or for one to one interactions with target molecules. To facilitate further explorations, a database of Tandem Repeats in Protein Sequences TRIPS ! has been developed and is available at URL: http:00www.ncl-india.org0trips.
Comparative promoter analysis is a promising strategy for elucidation of common regulatory modules conserved in evolutionarily related sequences or in genes showing common expression profiles. To facilitate such analysis, we have developed a software tool that detects conserved transcription factor binding sites, cis-elements, palindromes and k-tuples simultaneously in a set of sequences, and thus helps to identify putative motifs for designing further experiments.
Grain protein concentration (GPC) of hexaploid wheat is one of the important factors that determines the end-product quality as well as playing a pivotal role in human nutrition. In an attempt to identify PCR-based DNA markers linked to GPC, 106 recombinant inbred lines (RILs) were developed from a cross between two wheat cultivars PH132 and WL711, which differ significantly in GPC, by the single seed descent method. The RILs were phenotyped for GPC at two diverse agroclimatic locations, namely Pune and Ludhiana, to study the influence of genotype and environment interactions on this trait. The parents were screened with 85 inter simple sequence repeat (ISSR) primers and 350 random primers. The selective genotyping and whole population analysis revealed nine DNA markers associated with the trait. Three markers (UBC8441100, UBC8801000, and OPA4800) were observed to be associated with the trait in both locations, whereas two markers (OPH41400) and UBC873750) werefound to be specific to Pune, and four markers (OPM5870, OPO10870, OPV141200, and UBC8251000) were specific to Ludhiana. Together five markers at the Pune location representing five QTLs and seven markers at Ludhiana representing four QTLs accounted for 13.4 and 13.5% of total phenotypic variation, respectively. This study clearly demonstrates that GPC is highly influenced by the environment, and the applicability of ISSR and RAPD markers in finding regions on chromosomes associated with quantitative characters in wheat such as GPC.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.