2017
DOI: 10.7717/peerj-cs.118
|View full text |Cite
|
Sign up to set email alerts
|

VESPA: Very large-scale Evolutionary and Selective Pressure Analyses

Abstract: Background. Large-scale molecular evolutionary analyses of protein coding sequences requires a number of preparatory inter-related steps from finding gene families, to generating alignments and phylogenetic trees and assessing selective pressure variation. Each phase of these analyses can represent significant challenges, particularly when working with entire proteomes (all protein coding sequences in a genome) from a large number of species. Methods. We present VESPA, software capable of automating a selectiv… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(13 citation statements)
references
References 25 publications
0
13
0
Order By: Relevance
“…To further investigate candidate genes detected to be under positive selection, protein structure information from the relevant gene available in the literature was used. Nucleotide sequences of orthologous genes in FASTA format were translated into amino acids using “vespa.py translate” from the VEPSA tool 95 . The AWD protein sequences were then aligned to human annotated versions of the orthologous genes within Geneious v11.1.1 96 to identify amino acid changes that occur within a given protein domain (see Fig.…”
Section: Methodsmentioning
confidence: 99%
“…To further investigate candidate genes detected to be under positive selection, protein structure information from the relevant gene available in the literature was used. Nucleotide sequences of orthologous genes in FASTA format were translated into amino acids using “vespa.py translate” from the VEPSA tool 95 . The AWD protein sequences were then aligned to human annotated versions of the orthologous genes within Geneious v11.1.1 96 to identify amino acid changes that occur within a given protein domain (see Fig.…”
Section: Methodsmentioning
confidence: 99%
“…Ortholog assignments from Mauve were further refined using Pfam and COG category analysis in eggNOG-mapper ( 77 , 78 ). These ortholog assignments were checked against homology groups created through reciprocal nucleotide BLAST search using Vespa ( 79 ). Homology groups were translated and realigned using MAFFT ( 80 ) in Geneious Prime, and the percentages of identical residues for the nucleotide and amino acid alignments were reported.…”
Section: Methodsmentioning
confidence: 99%
“…A pair of alignments with a MetAl score of <0.15 were judged to be in mutual agreement, whereas a pair with a MetAl score of >0.15 were judged to be mutually discordant. 3 If any pair of alignments were judged to be mutually discordant by MetAl, the columnbased normalized mean distance of amino acid similarity across all sites in all three alignments was calculated using norMD, and the alignment with the highest similarity score was selected for orthogroup tree reconstruction (Thompson et al, 2001;Muller et al, 2010;Webb et al, 2017). Otherwise if all three alignment methods were in mutual agreement, the best-fit alignment was randomly selected.…”
Section: Orthogroup Alignment and Tree Reconstructionmentioning
confidence: 99%