2020
DOI: 10.1371/journal.pone.0229763
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of pre-processing on the meta-analysis of DNA methylation data from the Illumina HumanMethylation450 BeadChip platform

Abstract: Introduction Meta-analysis is a powerful means for leveraging the hundreds of experiments being run worldwide into more statistically powerful analyses. This is also true for the analysis of omic data, including genome-wide DNA methylation. In particular, thousands of DNA methylation profiles generated using the Illumina 450k are stored in the publicly accessible Gene Expression Omnibus (GEO) repository. Often, however, the intensity values produced by the BeadChip (raw data) are not deposited, therefore only … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
18
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 17 publications
(20 citation statements)
references
References 27 publications
2
18
0
Order By: Relevance
“…We have obtained that harmonization can increase the classification accuracy by up to 20%. This is fully consistent with the original paper, which proposed the harmonization method regRCPqn [44]. When the preprocessing of training and test data is the same, harmonization has almost no effect on the final classification accuracy.…”
Section: Discussionsupporting
confidence: 88%
See 1 more Smart Citation
“…We have obtained that harmonization can increase the classification accuracy by up to 20%. This is fully consistent with the original paper, which proposed the harmonization method regRCPqn [44]. When the preprocessing of training and test data is the same, harmonization has almost no effect on the final classification accuracy.…”
Section: Discussionsupporting
confidence: 88%
“…Ref. [44] developed an approach to systematically assess the impact of different preprocessing methods on meta-analysis. Its main advantage is the possibility of harmonization of the newly introduced datasets that does not require corrections to the previously analyzed datasets, employed for training the machine learning model.…”
Section: Introduction 1backgroundmentioning
confidence: 99%
“…Further, methylation data are stable and reproducible and offer a large amount of publicly available data, thanks to the cost-effectiveness of methylation arrays (Illumina Human Infinium Beadchips 27k, 450k and now 850k). This publicly available abundance of data, in turn, enables meta-analyses to advance discovery, thanks to numerous (ad hoc) preprocessing approaches [10].…”
Section: Introductionmentioning
confidence: 99%
“…We tested this using reliability metrics derived from analysis of technical replicates and found no evidence that hvCpGs are driven by technically unreliable probes. However, we note that better adjustment for technical artifacts within datasets 111 and the addition of further datasets would likely lead to the identification of more hvCpGs.…”
Section: Discussionmentioning
confidence: 96%