2022
DOI: 10.1186/s12859-022-04775-y
|View full text |Cite
|
Sign up to set email alerts
|

Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality

Abstract: Background The constant evolving and development of next-generation sequencing techniques lead to high throughput data composed of datasets that include a large number of biological samples. Although a large number of samples are usually experimentally processed by batches, scientific publications are often elusive about this information, which can greatly impact the quality of the samples and confound further statistical analyzes. Because dedicated bioinformatics methods developed to detect un… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(17 citation statements)
references
References 31 publications
0
17
0
Order By: Relevance
“…An additional point to observe in quality assessment are batch effects. Batch effects arise from differences between samples that are not rooted in the experimental design and can have various sources, spanning from different handlers or experiment locations to different batches of reagents and even biological artifacts such as growth location [44] . Various methods have been developed to detect and/or remove batch effects in genomics data, particularly RNA-seq data.…”
Section: Differential Expression Analysis (Pipeline)mentioning
confidence: 99%
“…An additional point to observe in quality assessment are batch effects. Batch effects arise from differences between samples that are not rooted in the experimental design and can have various sources, spanning from different handlers or experiment locations to different batches of reagents and even biological artifacts such as growth location [44] . Various methods have been developed to detect and/or remove batch effects in genomics data, particularly RNA-seq data.…”
Section: Differential Expression Analysis (Pipeline)mentioning
confidence: 99%
“…This highlights the reactivity of phosphorylation activity and the requirement for robust sample preparation protocols to create reliable datasets. However, these issues also greatly impact other omics methodologies such as single-cell RNA-Seq, RNA-Seq and CHIP-Seq [127][128][129].…”
Section: Data Variabilitymentioning
confidence: 99%
“…Because biological systems are complex, drug discovery experiments often possess conditional features whose importance is realized only after subsequent measurements. Capturing information such as a cell line, associated genetic engineering, and time of measurements can enable downstream AI-based detection of systematic measurement biases, such as batch effects (Sprang et al, 2022), and meta-analyses across experiments. Metadata can also include interpretations of data-what were the hits in an experiment and what criteria were employed in their selection?…”
Section: Requirements and Challenges For Using Aimentioning
confidence: 99%