2021
DOI: 10.3389/fdata.2021.725095
|View full text |Cite
|
Sign up to set email alerts
|

NPARS—A Novel Approach to Address Accuracy and Reproducibility in Genomic Data Science

Abstract: Background: Accuracy and reproducibility are vital in science and presents a significant challenge in the emerging discipline of data science, especially when the data are scientifically complex and massive in size. Further complicating matters, in the field of genomic-based science high-throughput sequencing technologies generate considerable amounts of data that needs to be stored, manipulated, and analyzed using a plethora of software tools. Researchers are rarely able to reproduce published genomic studies… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 44 publications
0
2
0
Order By: Relevance
“…When we applied Cactus to published datasets containing samples from C. elegans worms and human cells (Kolundzic et al, 2018), our pipeline confirmed the main findings of the study, provided additional observations, and generated new molecular hypotheses (Supplementary Materials), arguing that Cactus performs well in real-life scenarios. We believe that Cactus will be particularly useful for laboratories with limited time or bioinformatics expertise, and that it can help to reduce the reproducibility crisis seen in many omics studies (Ma et al, 2021) and science in general (Baker, 2016).…”
Section: Discussionmentioning
confidence: 99%
“…When we applied Cactus to published datasets containing samples from C. elegans worms and human cells (Kolundzic et al, 2018), our pipeline confirmed the main findings of the study, provided additional observations, and generated new molecular hypotheses (Supplementary Materials), arguing that Cactus performs well in real-life scenarios. We believe that Cactus will be particularly useful for laboratories with limited time or bioinformatics expertise, and that it can help to reduce the reproducibility crisis seen in many omics studies (Ma et al, 2021) and science in general (Baker, 2016).…”
Section: Discussionmentioning
confidence: 99%
“…Genomics datasets were processed as previously reported by the NGS Post-pipeline Accuracy and Reproducibility System (NPARS), a reproducible software infrastructure developed by our group (Ma et al, 2021). Three separate pathway analysis tools were utilized and all run using default parameters.…”
Section: Genomics Molecular Profilingmentioning
confidence: 99%