Homologous amino acid sequences of phospholipases A2 (PLA2) of snakes belonging to the families Elapidae, Viperidae, and Colubridae were considered in order to study the conservative and variable regions location. The PLA2 sequences were divided into two groups (taxons) according to the phylogenetic tree reconstructed from the pair similarity matrix. Results of the intergroup comparison were plotted to facilitate the identification of significant conservative and variable regions. It was shown that the results of the comparison between two phylogenetic groups of snake PLA2 did not much depend on the number of each group representatives and did not markedly change if one of the groups was represented by the single sequence. The knowledge of the number and location of conservative and variable regions and their dependence on the phylogenetic relations between compared taxa may be used to predict a synthetic peptide structure to obtain specific antibodies against PLA2 of one of these taxons. Such prediction is possible if there is a specific region conservative for one taxon but variable for two of them.
A method of identification of significant conservative and variable regions in homologous protein sequences is presented. A set of aligned homologous sequences is divided into two groups consisting of m and n most related sequences. Each pair of sequences from different group is compared using unitary similarity matrix. The superposition of pairwise comparisons scanned by a window of 10 amino acid residues gives intergroup local variability profile (VP). Area S of the figure between the VP and its mean value line is compared with averaged area S(r) of 1000 VPs of artificial homologous protein families. The difference (S-S(r)) given in standard deviation units sigma r is believed to be the amino acid substitution overall irregularity along the homologous protein sequences OI = (S-S(r))/sigma r. If OI greater than 2, the real VP extrema containing the surplus of area S-(S(r) + 2 sigma r) are cut off. The cut off stretches are likely to be significant conservative and variable regions. The significant conservative and variable regions of six homologous sequence families (phospholipases A2, cytochromes b, alpha-subunits of Na, K-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre and human rhodopsins) were identified. It was shown that for artificial homologous protein sequences derived by k-fold lengthening of natural proteins the OI value rises as square root of k. To compare the degree of substitution irregularity in homologous protein sequence families of different length L the value of standard substitution overall irregularity for L = 250 is proposed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.