2019
DOI: 10.1101/514497
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Systematic analysis of dark and camouflaged genes: disease-relevant genes hiding in plain sight

Abstract: Background The human genome contains ‘dark’ gene regions that cannot be adequately assembled or aligned using standard short-read sequencing technologies, preventing researchers from identifying mutations within these gene regions that may be relevant to human disease. Here, we identify regions that are ‘dark by depth’ (few mappable reads) and others that are ‘camouflaged’ (ambiguous alignment), and we assess how well long-read technologies resolve these regions. We further present an algorithm to resolve most… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 113 publications
0
2
0
Order By: Relevance
“…Targets disproportionately affected by this limitation include families of gene and pseudogene homologs (Ebbert et al, 2019). Other genomic regions are intractable to analysis due to difficulties caused by the sequencing chemistry itself.…”
Section: Introductionmentioning
confidence: 99%
“…Targets disproportionately affected by this limitation include families of gene and pseudogene homologs (Ebbert et al, 2019). Other genomic regions are intractable to analysis due to difficulties caused by the sequencing chemistry itself.…”
Section: Introductionmentioning
confidence: 99%
“…“Population-scale” sequencers, capable of performing cheap whole genome sequencing (WGS) have been used extensively by large research programs, but they are now also being deployed in diagnostic practice [1]. Despite this shift towards WGS, which simplifies laboratory workflows by eliminating the need for target enrichment, a number of clinically relevant genomic regions are known to be intractable to analysis by short-read-sequencing [2].…”
mentioning
confidence: 99%