2021
DOI: 10.1101/2021.01.12.426440
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DISMIR: a deep learning-based cancer-detection method by integrating DNA sequence and methylation information of individual cell-free DNA reads

Abstract: Detecting cancer signals in cell-free DNA (cfDNA) high-throughput sequencing data is emerging as a novel non-invasive cancer detection method. Due to the high cost of sequencing, it is crucial to make robust and precise prediction with low-depth cfDNA sequencing data. Here we propose a novel approach named DISMIR, which can provide ultrasensitive and robust cancer detection by integrating DNA sequence and methylation information in plasma cfDNA whole genome bisulfite sequencing (WGBS) data. DISMIR introduces a… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…The α-value of an individual sequencing read captures the pervasive nature of methylation, therefore allowing us to "purify" tumor DNA reads from background reads for enhancing the signal-to-noise ratio. A recent study further used the α-value concept for identifying liver cancer methylation markers 28 . Here we developed a general framework for using read-level α-values to robustly identify markers from impure tissue samples or even cfDNA plasma samples.…”
Section: Read-based Discovery Of Methylation Markersmentioning
confidence: 99%
“…The α-value of an individual sequencing read captures the pervasive nature of methylation, therefore allowing us to "purify" tumor DNA reads from background reads for enhancing the signal-to-noise ratio. A recent study further used the α-value concept for identifying liver cancer methylation markers 28 . Here we developed a general framework for using read-level α-values to robustly identify markers from impure tissue samples or even cfDNA plasma samples.…”
Section: Read-based Discovery Of Methylation Markersmentioning
confidence: 99%
“…After all, the method for finding markers was optimized for CpG count data and then translated almost exactly to read average data. Read averages may, however, require a completely different approach for finding markers, such as the switching reads defined by Li et al (12). An adequate set of differential regions not only improves model performance but also allows for targeted sequencing of these regions only, for example using RRBS, and can thus reduce the sequencing cost (20).…”
Section: Discussionmentioning
confidence: 99%
“…A read average is calculated by dividing the number of methylated CpG sites by the total number of CpG sites on a read, where only reads with three or more CpG sites are used. This heuristic is adopted from previous methods (11,12). The read average is then rounded to the closest value i. E.g.…”
Section: Read-based Approachmentioning
confidence: 99%
See 2 more Smart Citations