For data analysis to be reproducible, the data and code should be assembled in a way such that results (e.g. tables and figures) can be re-created. While the scientific community is by and large in agreement that reproducibility is a minimal standard by which data analyses should be evaluated, and a myriad of software tools for reproducible computing exist, it is still not trivial to reproduce someone’s (sometimes your own!) results without fiddling with unavailable analysis data, external dependencies, missing packages, out of date software, etc. In this workshop we will demonstrate a workflow for reproducible data science with R, R Markdown, Git, and GitHub. Experience with R is expected but familiarity with the other tools is not required.
Slides for the workshop can be found here.