2012
DOI: 10.1101/gr.132480.111
|View full text |Cite
|
Sign up to set email alerts
|

SOAPindel: Efficient identification of indels from short paired reads

Abstract: We present a new approach to indel calling that explicitly exploits that indel differences between a reference and a se-quenced sample make the mapping of reads less efficient. We assign all unmapped reads with a mapped partner to their expected genomic positions and then perform extensive de novo assembly on the regions with many unmapped reads to resolve homozygous, heterozygous, and complex indels by exhaustive traversal of the de Bruijn graph. The method is implemented in the software SOAPindel and provide… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

5
88
0
1

Year Published

2013
2013
2022
2022

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 116 publications
(94 citation statements)
references
References 15 publications
5
88
0
1
Order By: Relevance
“…Some methods apply targeted assembly to validate the breakpoint calls (e.g., Manta [Chen et al 2016], SVMerge [Wong et al 2010], TIGRA [Chen et al 2014]). Windowed breakpoint assembly has been used (e.g., SOAPindel [Li et al 2013], DISCOVAR [Weisenfeld et al 2014]), but detection is limited to events smaller than the window size. Whole-genome de novo assembly has also been used for variant calling (e.g., Cortex [Iqbal et al 2012]), but its use has been limited, in part due to the computational expense compared to alignment-based approaches.…”
mentioning
confidence: 99%
“…Some methods apply targeted assembly to validate the breakpoint calls (e.g., Manta [Chen et al 2016], SVMerge [Wong et al 2010], TIGRA [Chen et al 2014]). Windowed breakpoint assembly has been used (e.g., SOAPindel [Li et al 2013], DISCOVAR [Weisenfeld et al 2014]), but detection is limited to events smaller than the window size. Whole-genome de novo assembly has also been used for variant calling (e.g., Cortex [Iqbal et al 2012]), but its use has been limited, in part due to the computational expense compared to alignment-based approaches.…”
mentioning
confidence: 99%
“…represented, while long are underrepresented [86,87], thus creating an accuracy dependence on a homo-polymer length. Most typical error for the pyrosequencing-type technologies is also indel.…”
Section: Platform-specific Biasesmentioning
confidence: 99%
“…Identifying indels from NGS is known to be very challenging [87], because 'indel by itself interferes with accurate mapping' . To map indels accurately, Pair-End (PE) information is utilised [88].…”
Section: Platform-specific Biasesmentioning
confidence: 99%
“…This method has been applied in the 1000 Genomes Project and provided high quality breakpoint information. In the literature, we have also found other packages adopting the local assembly method, shown in Table 1, such as BreaKmer [36], HYDRA [37], SOAPindel [38] and NovelSeq [39].…”
Section: Assemblymentioning
confidence: 99%