2021
DOI: 10.1101/2021.09.01.457579
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

BWA-MEME: BWA-MEM emulated with a machine learning approach

Abstract: The growing use of next-generation sequencing and enlarged sequencing throughput require efficient short-read alignment, where seeding is one of the major performance bottlenecks. The key challenge in the seeding phase is searching for exact matches of substrings of short reads in the reference DNA sequence. Existing algorithms, however, present limitations in performance due to their frequent memory accesses. This paper presents BWA-MEME, the first full-fledged short read alignment software that leverages lea… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…20 Following trimming, paired-end reads were mapped to the hg38 human reference genome using BWA-MEM2 (parameters: -K 1000000000 -M, BWA-MEM2 reference). 21 Germline variants were called using DeepVariant, a deep learning-based variant caller. 22 We used VEP 23 and Slivar 24 to tag variants associated with different genes and to remove common variants reported in gnomAD.…”
Section: Exome Sequencingmentioning
confidence: 99%
“…20 Following trimming, paired-end reads were mapped to the hg38 human reference genome using BWA-MEM2 (parameters: -K 1000000000 -M, BWA-MEM2 reference). 21 Germline variants were called using DeepVariant, a deep learning-based variant caller. 22 We used VEP 23 and Slivar 24 to tag variants associated with different genes and to remove common variants reported in gnomAD.…”
Section: Exome Sequencingmentioning
confidence: 99%
“…Firstly, The variant calling process was conducted by SAMtools 'mpileup' command, and the single nucleotide polymorphisms (SNPs) were identified by BCFtools 'call' command [31]. Secondly, to validate the SNPs, we also employed Genome Analysis Toolkit (GATK, v4.0) to detect SNPs [30], we mapped the clean reads to the reference using BWA-MEM with default parameters [53]; the multiple tools ('Mark-Duplicates' , 'HaplotypeCaller' and 'VariantFiltration' , etc.,) implemented in GATK [30] were used to obtain high-quality SNPs, with strict filter settings "QD < 2.0 || MQ < 40.0 || FS > 60.0 || SOR > 3.0 || MQRank-Sum < -12.5 || 218 ReadPosRankSum < -8.0". For chloroplast, based on their SNP-calling results and gene annotation files, RNA editing sites were identified by using the REDO tool [54].…”
Section: Identification Of Rna Editing Sitesmentioning
confidence: 99%