This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Human papillomavirus (HPV) 16 and 18 are the most predominant types in cervical cancer. Only a small fraction of HPV infections progress to cancer, indicating that additional factors and genomic events contribute to the carcinogenesis, such as minor nucleotide variation caused by APOBEC3 and chromosomal integration. We analysed intra-host minor nucleotide variants (MNVs) and integration in HPV16 and HPV18 positive cervical samples with different morphology. Samples were sequenced using an HPV whole genome sequencing protocol TaME-seq. A total of 80 HPV16 and 51 HPV18 positive samples passed the sequencing depth criteria of 300× reads, showing the following distribution: non-progressive disease (HPV16 n = 21, HPV18 n = 12); cervical intraepithelial neoplasia (CIN) grade 2 (HPV16 n = 27, HPV18 n = 9); CIN3/adenocarcinoma in situ (AIS) (HPV16 n = 27, HPV18 n = 30); cervical cancer (HPV16 n = 5). Similar numbers of MNVs in HPV16 and HPV18 samples were observed for most viral genes, with the exception of HPV18 E4 with higher numbers across clinical categories. APOBEC3 signatures were observed in HPV16 lesions, while similar mutation patterns were not detected for HPV18. The proportion of samples with integration was 13% for HPV16 and 59% for HPV18 positive samples, with a noticeable portion located within or close to cancer-related genes.
Background Previously developed TaME-seq method for deep sequencing of HPV, allowed simultaneous identification of the human papillomavirus (HPV) DNA consensus sequence, low-frequency variable sites, and chromosomal integration events. The method has been successfully validated and applied to the study of five carcinogenic high-risk (HR) HPV types (HPV16, 18, 31, 33, and 45). Here, we present TaME-seq2 with an updated laboratory workflow and bioinformatics pipeline. The HR-HPV type repertoire was expanded with HPV51, 52, and 59. As a proof-of-concept, TaME-seq2 was applied on SARS-CoV-2 positive samples showing the method’s flexibility to a broader range of viruses, both DNA and RNA. Results Compared to TaME-seq version 1, the bioinformatics pipeline of TaME-seq2 is approximately 40× faster. In total, 23 HPV-positive samples and seven SARS-CoV-2 clinical samples passed the threshold of 300× mean depth and were submitted to further analysis. The mean number of variable sites per 1 kb was ~ 1.5× higher in SARS-CoV-2 than in HPV-positive samples. Reproducibility and repeatability of the method were tested on a subset of samples. A viral integration breakpoint followed by a partial genomic deletion was found in within-run replicates of HPV59-positive sample. Identified viral consensus sequence in two separate runs was > 99.9% identical between replicates, differing by a couple of nucleotides identified in only one of the replicates. Conversely, the number of identical minor nucleotide variants (MNVs) differed greatly between replicates, probably caused by PCR-introduced bias. The total number of detected MNVs, calculated gene variability and mutational signature analysis, were unaffected by the sequencing run. Conclusion TaME-seq2 proved well suited for consensus sequence identification, and the detection of low-frequency viral genome variation and viral-chromosomal integrations. The repertoire of TaME-seq2 now encompasses seven HR-HPV types. Our goal is to further include all HR-HPV types in the TaME-seq2 repertoire. Moreover, with a minor modification of previously developed primers, the same method was successfully applied for the analysis of SARS-CoV-2 positive samples, implying the ease of adapting TaME-seq2 to other viruses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.