2018
DOI: 10.1186/s12864-018-4766-y
|View full text |Cite
|
Sign up to set email alerts
|

BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues

Abstract: BackgroundBisulfite sequencing is widely employed to study the role of DNA methylation in disease; however, the data suffer from biases due to coverage depth variability. Imputation of methylation values at low-coverage sites may mitigate these biases while also identifying important genomic features associated with predictive power.ResultsHere we describe BoostMe, a method for imputing low-quality DNA methylation estimates within whole-genome bisulfite sequencing (WGBS) data. BoostMe uses a gradient boosting … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
24
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(25 citation statements)
references
References 69 publications
1
24
0
Order By: Relevance
“…We showed that at the recommended sequencing depth of 30x, the technical variability for WGBS is two to three fold higher than that for the methylation array. This lower precision observed is likely to be due to insufficient reads/counts to generate reliable quantification 48 . The precision of WGBS improves with increased sequencing depth as observed from our downsampling analyses, and a minimum coverage of 100x is necessary to achieve a level of precision that is broadly similar to that observed in methylation array.…”
Section: Discussionmentioning
confidence: 99%
“…We showed that at the recommended sequencing depth of 30x, the technical variability for WGBS is two to three fold higher than that for the methylation array. This lower precision observed is likely to be due to insufficient reads/counts to generate reliable quantification 48 . The precision of WGBS improves with increased sequencing depth as observed from our downsampling analyses, and a minimum coverage of 100x is necessary to achieve a level of precision that is broadly similar to that observed in methylation array.…”
Section: Discussionmentioning
confidence: 99%
“…XGBoost was successfully applied in hundreds of recent studies to predict, e.g. host-pathogen protein–protein interactions (16), microRNA disease association (17) and DNA methylation (18). Several studies including our own previous paper showed that XGBoost gives the best performance if compared with a number of known machine learning methods (see e.g.…”
Section: Description Of the Databasementioning
confidence: 99%
“…The XGBoost model has achieved excellent performance in many fields of medical research. [23][24][25][26] Currently, no researchers have used the XGBoost model to predict the time series data of human brucellosis.…”
mentioning
confidence: 99%