Introduction
The propensity of nucleotide bases to form pairs, causes folding and the formation of secondary structure in the RNA. Therefore, purine (R): pyrimidine (Y) base-pairing is vital to maintain uniform lateral dimension in RNA secondary structure. Transversions or base substitutions between R and Y bases, are more detrimental to the stability of RNA secondary structure, than transitions derived from substitutions between A and G or C and T. The study of transversion and transition base substitutions is important to understand evolutionary mechanisms of RNA secondary structure in the 5′ and 3′ untranslated (UTR) regions of SARS-CoV-2. In this work, we carried out comparative analysis of transition and transversion base substitutions in the stem and loop regions of RNA secondary structure of SARS-CoV-2.
Methods
We have considered the experimentally determined and well documented stem and loop regions of 5′ and 3′ UTR regions of SARS-CoV-2 for base substitution analysis. The secondary structure comprising of stem and loop regions were visualized using the RNAfold web server. The GISAID repository was used to extract base sequence alignment of the UTR regions. Python scripts were developed for comparative analysis of transversion and transition frequencies in the stem and the loop regions.
Results
The results of base substitution analysis revealed a higher transition (ti) to transversion (tv) ratio (ti/tv) in the stem region of UTR of RNA secondary structure of SARS-CoV-2 reported during the early stage of the pandemic. The higher ti/tv ratio in the stem region suggested the influence of secondary structure in selecting the pattern of base substitutions. This differential pattern of ti/tv values between stem and loop regions was not observed among the Delta and Omicron variants that dominated the later stage of the pandemic. It is noteworthy that the ti/tv values in the stem and loop regions were similar among the later dominant Delta and Omicron variant strains which is to be investigated to understand the rapid evolution and global adaptation of SARS-CoV-2.
Conclusion
Our findings implicate the lower frequency of transversions than the transitions in the stem regions of UTRs of SARS-CoV-2. The RNA secondary structures are associated with replication, translation, and packaging, further investigations are needed to understand these base substitutions across different variants of SARS-CoV-2.