We have recently published a new probabilistic algorithm which performs genomic comparisons on a huge scale. In the present paper it was applied to immunodeficiency viral sequences extracted from international gene databanks. During global sequence analysis of human (HIV1 and HIV2) and simian viruses by means of dot-matrix representation, series of homology were obtained which permitted the definition of families of viruses overlapping the species divisions. Sequences of interest were characterized to the lexical base sentence through successive zoomings. Strain-to-strain comparison confirmed subfamily classifications and led, for example, to the identification of divergent LTR sequences. By way of example, we described the application of the algorithm to the ANT70C and MVP5180 HIV1-O viruses, for which the observed differences were shown to correspond to a deletion in the U3 region, situated between the LEF and NF-kappaB sites. It was of interest to consider these data in a tentative phylogenetic interpretation.
Considering that recombinations produce quasispecies in lentivirus spreading, we identified and localized highly conserved sequences that may play an important role in viral ontology. Comparison of entire genomes, including 237 human, simian and non-primate mammal lentiviruses and 103 negative control viruses, led to identify 28 Conserved Lentiviral Sequences (CLSs). They were located mainly in the structural genes forming hot spots particularly in the gag and pol genes and to a lesser extent in LTRs and regulatory genes. The CLS pattern was the same throughout the different HIV-1 subtypes, except for some HIV-1-O strains. Only CLS 3 and 4 were detected in both negative control HTLV-1 oncornaviruses and D-particle-forming simian viruses, which are not immunodeficiency inducers and display a genetic stability. CLSs divided the virus genomes into domains allowing us to distinguish sequence families leading to the notion of 'species self' besides that of 'lentiviral self'. Most of acutely localized CLSs in HIV-1s (82%) corresponded to wide recombination segments being currently reported.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.