2019
DOI: 10.1093/bib/bbz124
|View full text |Cite
|
Sign up to set email alerts
|

Current RNA-seq methodology reporting limits reproducibility

Abstract: Ribonucleic acid sequencing (RNA-seq) identifies and quantifies RNA molecules from a biological sample. Transformation from raw sequencing data to meaningful gene or isoform counts requires an in silico bioinformatics pipeline. Such pipelines are modular in nature, built using selected software and biological references. Software is usually chosen and parameterized according to the sequencing protocol and biological question. However, while biological and technical noise is alleviated through replicates, biase… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
57
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 74 publications
(57 citation statements)
references
References 25 publications
0
57
0
Order By: Relevance
“…Lack of reproducibility for large dataset analysis is a chronic issue (Łabaj & Kreil, 2016;Simoneau et al, 2019). While some programs are commonly used for differential expression analysis (edgeR, DESeq2, limma), we felt justified to include a T test method, as it increased transparency and showed that a bulk of the DEGs identified by the two commonly used programs were supported by simple T tests.…”
Section: Rna-seq Time-series Experimentsmentioning
confidence: 99%
“…Lack of reproducibility for large dataset analysis is a chronic issue (Łabaj & Kreil, 2016;Simoneau et al, 2019). While some programs are commonly used for differential expression analysis (edgeR, DESeq2, limma), we felt justified to include a T test method, as it increased transparency and showed that a bulk of the DEGs identified by the two commonly used programs were supported by simple T tests.…”
Section: Rna-seq Time-series Experimentsmentioning
confidence: 99%
“…While the format of this study does not let us provide direct recommendations regarding what methodological choices to prefer, we can highlight choices that we would not recommend. While Tophat2 is still one of the primary aligner software in use in the literature 17 , we cannot recommend its usage due to its bias towards genes with processed pseudogenes. Furthermore, HISAT2 is a reimplementation of TopHat2, correcting some issues that have become apparent through the years.…”
Section: Discussionmentioning
confidence: 99%
“…We performed RNA-seq using only methods that rely on genome-based alignment, and software that are restricted to a single methodological step. We kept default parameters for most of the options, as to mimic what is being done in the literature 17 while Ensembl 98 uses GRCh38.p13 genome. Because the primary assembly of both these reference genome versions is the same, and because we restricted our studied genes to the primary assembly, only GRCh38.p13 was used.…”
Section: Rna-seq Methodsologymentioning
confidence: 99%
“…Likewise, Simoneau and Scott [8] described information on genome assembly and annotation as "essential" for describing the computational analysis of RNA-seq data, and contended that, "no study using RNA-seq should be published without these methodological details." Simoneau and co-authors have recently performed a detailed analysis of hundreds of published RNA-seq studies, finding that the majority did not include annotation source and release information, thus hindering reproducible analysis [9].…”
Section: Introductionmentioning
confidence: 99%