2019
DOI: 10.31234/osf.io/8xzqy
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Reproducible Data Analysis Workflow with R Markdown, Git, Make, and Docker

Abstract: In this tutorial, we describe a workflow to ensure long-term reproducibility of R-based data analyses. The workflow leverages established tools and practices from software engineering. It combines the benefits of various open-source software tools including R Markdown, Git, Make, and Docker, whose interplay ensures seamless integration of version management, dynamic report generation conforming to various journal styles and full cross-platform and long-term computational reproducibility. The workflow ensures m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 17 publications
(24 citation statements)
references
References 20 publications
0
24
0
Order By: Relevance
“…See liftr.me for further details. See also Peikert and Brandmaier (2019) for a suggested comprehensive workflow, including version-controlled data management, dependency management using Makefiles, containerized computing environments using Docker, and dynamic document generation using RMarkdown.…”
Section: Containerization Beyond Computing Environmentsmentioning
confidence: 99%
“…See liftr.me for further details. See also Peikert and Brandmaier (2019) for a suggested comprehensive workflow, including version-controlled data management, dependency management using Makefiles, containerized computing environments using Docker, and dynamic document generation using RMarkdown.…”
Section: Containerization Beyond Computing Environmentsmentioning
confidence: 99%
“…The stetup with Docker vignette explains how to instantiate this build. For more information on this topic, Peikert and Brandmaier [28] describe a Docker-based workflow for computational reproducibility that is conceptually and practically compatible with WORCS.…”
Section: Dependency Managementmentioning
confidence: 99%
“…It is thus prudent to consider separately licensing specific project resources, particularly when a project contains original source code (i.e., you write your own functions). For a more extensive discussion of appropriate licensing, see Peikert and Brandmaier [28], Stodden et al [38], and the website choosealicense.com.…”
Section: Compatibility With Other Standards For Open Sciencementioning
confidence: 99%
See 2 more Smart Citations