Secondary structure models are an important step for aligning sequences, understanding probabilities of nucleotide substitutions, and evaluating the reliability of phylogenetic reconstructions. A set of conserved sequence motifs is derived from comparative sequence analysis of 184 invertebrate and vertebrate taxa (including many taxa from the same genera, families, and orders) with reference to a secondary structure model for domain III of animal mitochondrial small subunit (12S) ribosomal RNA. A template is presented to assist with secondary structure drawing. Our model is similar to previous models but is more specific to mitochondrial DNA, fitting both invertebrate and vertebrate groups, including taxa with markedly different nucleotide compositions. The second half of the domain III sequence can be difficult to align precisely, even when secondary structure information is considered. This is especially true for comparisons of anciently diverged taxa, but well-conserved motifs assist in determining biologically meaningful alignments. Patterns of conservation and variability in both paired and unpaired regions make differential phylogenetic weighting in terms of "stems" and "loops" unsatisfactory. We emphasize looking carefully at the sequence data before and during analyses, and advocate the use of conserved motifs and other secondary structure information for assessing sequencing fidelity.
The evolutionary origin of the pinnipeds (seals, sea lions, and walruses) is still uncertain. Most authors support a hypothesis of a monophyletic origin of the pinnipeds from a caniform carnivore. A minority view suggests a diphyletic origin with true seals being related to the mustelids (otters and ferrets). The phylogenetic relationships of the walrus to other pinniped and carnivore families are also still particularly problematic. Here we examined the relative support for mono- and diphyletic hypotheses using DNA sequence data from the mitochondrial small subunit (12S) rRNA and cytochrome b genes. We first analyzed a small group of taxa representing the three pinniped families (Phocidae, Otariidae, and Odobenidae) and caniform carnivore families thought to be related to them. We inferred phylogenetic reconstructions from DNA sequence data using standard parsimony and neighbor-joining algorithms for phylogenetic inference as well as a new method called spectral analysis (Hendy and Penny) in which phylogenetic information is displayed independently of any selected tree. We identified and compensated for potential sources of error known to lead to selection of incorrect phylogenetic trees. These include sampling error, unequal evolutionary rates on lineages, unequal nucleotide composition among lineages, unequal rates of change at different sites, and inappropriate tree selection criteria. To correct for these errors, we performed additional transformations of the observed substitution patterns in the sequence data, applied more stringent structural constraints to the analyses, and included several additional taxa to help resolve long, unbranched lineages in the tree. We find that there is strong support for a monophyletic origin of the pinnipeds from within the caniform carnivores, close to the bear/raccoon/panda radiation. Evidence for a diphyletic origin was very weak and can be partially attributed to unequal nucleotide compositions among the taxa analyzed. Subsequently, there is slightly more evidence for grouping the walrus with the eared seals versus the true seals. A more conservative interpretation, however, is that the walrus is an early, but not the first, independent divergence from the common pinniped ancestor.
The performances of five global multiple-sequence alignment programs (CLUSTAL W, Divide and Conquer, Malign, PileUp, and TreeAlign) were evaluated using part of the animal mitochondrial small subunit (12S) rRNA molecule. Conserved sequence motifs derived from an alignment based on secondary structural information were used to score how well each program aligned a data set of five vertebrate and five invertebrate taxa over a range of parameter values. All of the programs could align the motifs with reasonable accuracy for at least one set of parameter conditions, although if the whole sequence was considered, similarity to the structural alignment was only 25%-34%. Use of small gap costs generally gave more accurate results, although Malign and TreeAlign generated longer alignments when gap costs were low. The programs differed in the consistency of the alignments when gap cost was varied; CLUSTAL W, Divide and Conquer, and TreeAlign were the most accurate and robust, while PileUp performed poorly as gap cost values increased, and the accuracy of Malign fluctuated. Default settings for the programs did not give the best results, and attempting to select similar parameter values in different programs did not always result in more similar alignments. Poor alignment of even well-conserved motifs can occur if these are near sites with insertions or deletions. Since there is no a priori way to determine gap costs and because such costs can vary over the gene, alignment of rRNA sequences, particularly the less well conserved regions, should be treated carefully and aided by secondary structure and conserved motifs. Some motifs are single bases and so are often invisible to alignment programs. Our tests involved the most conserved regions of the 12S rRNA gene, and alignment of less well conserved regions will be more problematical. None of the alignments we examined produced a fully resolved phylogeny for the data set, indicating that this portion of 12S rRNA is insufficient for resolution of distant evolutionary relationships.
The evolutionary history of 25 New Zealand scincid lizards in the endemic genera Oligosoma and Cyclodina was examined using 12s rRNA sequence data. Phylogenetic resolution was poor, despite there being up to 9% sequence divergence between taxa. Lack of resolution was not attributable to biases in the data, such as site saturation or differences in sites free to vary, so we infer that New Zealand skinks underwent two relatively rapid phases of divergence. The rate of substitution for the skink sequences appears to be similar to some bird and mammal groups for which times of divergence have been estimated. Using these calibrations diversification of Olzgosorna skinks probably began at least 23 million years ago (Mya). The pattern of relationships and the timing of this diversification are interpreted as resulting from rapid allopatric speciation during the Oligocene (25-35 Mya) when New Zealand was fragmented into many low lying islands.
Sequences of macromolecules have "signals" or patterns that arise from a number of sources, particularly from shared common history or phylogeny. We discuss methods for inferring evolutionary trees from these patterns or signals under five properties desired for an ideal method. These five desiderata are that the methods be efficient (fast), consistent, powerful, robust, and falsifiable. Our conclusion is that corrections for multiple changes in sequences are the most important factor for any method to be consistent. Most optimality criteria, including compatibility and parsimony, become consistent when the sequences have appropriate corrections for multiple changes. Conversely, virtually no methods are consistent without adjustments for multiple changes. Hadamard conjugations are used to illustrate relationships between different methods and then illustrated by combining it with the closest tree optimality criterion. The data used to illustrate these recent developments include DNA sequences used to study B93018 Received 2 February 1993; accepted 13 July 1993 the origin of chloroplasts and also New Zealand skinks (Leiolopisma spp).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.