Molecular evolution analyses, such as detection of adaptive/purifying selection or ancestral protein reconstruction, typically require three inputs for a target gene (or gene family) in a particular group of organisms: sequence alignment, model of evolution, and phylogenetic tree. While modern advances in high-throughput sequencing techniques have led to rapid accumulation of genomic-scale data in public repositories and databases, mining such vast amount of information often remains a challenging enterprise. Here, we describe a comprehensive, versatile workflow aimed at the preparation of genome-extracted datasets readily available for molecular evolution research. The workflow involves: (1) fishing (searching and capturing) specific gene sequences of interest from taxonomically diverse genomic data available in databases at variable levels of annotation, (2) processing and depuration of retrieved sequences, (3) production of a multiple sequence alignment, (4) selection of best-fit model of evolution, and (5) solid reconstruction of a phylogenetic tree.
Aquaporins (AQPs) are a highly diverse family of transmembrane proteins involved in osmotic regulation that played an important role in the conquest of land by tetrapods. However, little is known about their possible implication in the acquisition of an amphibious lifestyle in actinopterygian fishes. Herein, we investigated the molecular evolution of AQPs in 22 amphibious actinopterygian fishes by assembling a comprehensive dataset that was used to (1) catalogue AQP paralog members and classes; (2) determine the gene family birth and death process; (3) test for positive selection in a phylogenetic framework; and (4) reconstruct structural protein models. We found evidence of adaptive evolution in 21 AQPs belonging to 5 different classes. Almost half of the tree branches and protein sites that were under positive selection were found in the AQP11 class. The detected sequence changes indicate modifications in molecular function and/or structure, which could be related to adaptation to an amphibious lifestyle. AQP11 orthologues appear to be the most promising candidates to have facilitated the processes of the water-to-land transition in amphibious fishes. Additionally, the signature of positive selection found in the AQP11b stem branch of the Gobiidae clade suggests a possible case of exaptation in this clade.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.