2018
DOI: 10.3389/fgene.2018.00083
|View full text |Cite
|
Sign up to set email alerts
|

Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned

Abstract: It is well-known, but frequently overlooked, that low- and high-throughput molecular data may contain batch effects, i.e., systematic technical variation. Confounding of experimental batches with the variable(s) of interest is especially concerning, as a batch effect may then be interpreted as a biologically significant finding. An integral step toward reducing false discovery in molecular data analysis includes inspection for batch effects and accounting for this signal if present. In a 30-sample pilot Illumi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
65
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 76 publications
(70 citation statements)
references
References 40 publications
5
65
0
Order By: Relevance
“…Additionally, we evaluated CpG sites with significantly lower methylation levels in familial melanoma than in healthy control samples and found 35 hypomethylated CpGs in both members of a family (Supplementary Table 2). Fifteen CpG sites showed hypomethylation in all 5 families, suggestive of a batch effect as has been described for 450k methylation arrays [20]. Of the 35 hypomethylated CpG sites, only 2 were located in established cancer-related genes: BRCA1, an established breast and ovarian cancer susceptibility gene, and ROS1, encoding a receptor tyrosine kinase with a possible oncogenic role in melanoma [21].…”
Section: Resultssupporting
confidence: 51%
“…Additionally, we evaluated CpG sites with significantly lower methylation levels in familial melanoma than in healthy control samples and found 35 hypomethylated CpGs in both members of a family (Supplementary Table 2). Fifteen CpG sites showed hypomethylation in all 5 families, suggestive of a batch effect as has been described for 450k methylation arrays [20]. Of the 35 hypomethylated CpG sites, only 2 were located in established cancer-related genes: BRCA1, an established breast and ovarian cancer susceptibility gene, and ROS1, encoding a receptor tyrosine kinase with a possible oncogenic role in melanoma [21].…”
Section: Resultssupporting
confidence: 51%
“…Raw intensity was read into Illumina Genome Studio software 2011.1, and background normalization was applied. Data processing was performed as described in Price et al [ 73 ], including sample quality checks, probe filtering, data normalization, and batch correction. This processing pipeline resulted in a final N = 442,355 cytosine-guanine dinucleotide (CpG) sites from the 450k array for analysis.…”
Section: Methodsmentioning
confidence: 99%
“…It is now considered to be the gold standard method for performing reproductible and accurate measurements as well as for confirming the results from high-throughput platforms [ 108 ]. Finally, another possible source of discrepancy is the batch effect, a common source of technical variation that may then be interpreted as a biologically significant finding [ 109 ]. Pairs of non-lesional and lesional skin samples might not always be processed simultaneously, thus creating a batch effect inflating the number of differently expressed genes.…”
Section: Transcriptome Profiling Analysis For a Better Understandimentioning
confidence: 99%