DNA fragmentation is a fundamental step during library preparation in hybridization capturebased, short-read sequencing. Ultra-sonication has been used thus far to prepare DNA of an appropriate size, but this method is associated with a considerable loss of DNA sample. More recently, studies have employed library preparation methods that rely on enzymatic fragmentation with DNA endonucleases to minimize DNA loss, particularly in nano-quantity samples. Yet, despite their wide use, the effect of enzymatic fragmentation on the resultant sequences has not been carefully assessed. Here, we used pairwise comparisons of somatic variants of the same tumor DNA samples prepared using ultrasonic and enzymatic fragmentation methods. Our analysis revealed a substantially larger number of recurrent artifactual SNVs/indels in endonuclease-treated libraries as compared with those created through ultrasonication. These artifacts were marked by palindromic structure in the genomic context, positional bias in sequenced reads, and multi-nucleotide substitutions. Taking advantage of these distinctive features, we developed a filtering algorithm to distinguish genuine somatic mutations from artifactual noise with high specificity and sensitivity. Noise cancelling recovered the composition of the mutational signatures in the tumor samples. Thus, we provide an informatics algorithm as a solution to the sequencing errors produced as a consequence of endonuclease-mediated fragmentation, highlighted for the first time in this study.
BackgroundIt is known that functional RNAs often switch their functions by forming different secondary structures. Popular tools for RNA secondary structures prediction, however, predict the single ‘best’ structures, and do not produce alternative structures. There are bioinformatics tools to predict suboptimal structures, but it is difficult to detect which alternative secondary structures are essential.ResultsWe proposed a new computational method to detect essential alternative secondary structures from RNA sequences by decomposing the base-pairing probability matrix. The decomposition is calculated by a newly implemented software tool, RintW, which efficiently computes the base-pairing probability distributions over the Hamming distance from arbitrary reference secondary structures. The proposed approach has been demonstrated on ROSE element RNA thermometer sequence and Lysine RNA ribo-switch, showing that the proposed approach captures conformational changes in secondary structures.ConclusionsWe have shown that alternative secondary structures are captured by decomposing base-paring probabilities over Hamming distance. Source code is available from http://www.ncRNA.org/RintW.
Genes involved in the homologous recombination repair pathway—as exemplified by BRCA1, BRCA2, PALB2, ATM, and CHEK2—are frequently associated with hereditary breast and ovarian cancer syndrome. Germline mutations in the loci of these genes with loss of heterozygosity or additional somatic truncation at the WT allele lead to the development of breast cancers with characteristic clinicopathological features and prominent genomic features of homologous recombination deficiency, otherwise referred to as “BRCAness.” Although clinical genetic testing for these and other genes has increased the chances of identifying pathogenic variants, there has also been an increase in the prevalence of variants of uncertain significance, which poses a challenge to patient care because of the difficulties associated with making further clinical decisions. To overcome this challenge, we sought to develop a methodology to reclassify the pathogenicity of these unknown variants using statistical modeling of BRCAness. The model was developed with Lasso logistic regression by comparing 116 genomic attributes derived from 37 BRCA1/2 biallelic mutant and 32 homologous recombination‐quiescent breast cancer exomes. The model showed 95.8% and 86.7% accuracies in the training cohort and The Cancer Genome Atlas validation cohort, respectively. Through application of the model for variant reclassification of homologous recombination‐associated hereditary breast and ovarian cancer causal genes and further assessment with clinicopathological features, we finally identified one likely pathogenic and five likely benign variants. As such, the BRCAness model developed from the tumor exome was robust and provided a reasonable basis for variant reclassification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.