Welcome to the PSU Bioinformatics Data Reproducibility Bootcamp!
Location and General Instructions
The boot camp will be held at the Chemical and Biomedical Engineering Building, Room 001.
If you want to join us a little earlier, we'll have breakfast between 9:00-9:30 AM on the third floor of the Huck Institutes for the Life Sciences, at the Life Sciences Bridge. Lunch will also be provided at the same venue.
Please don't forget to register on the sign-up sheet when you arrive.
If you have any trouble navigating through campus, please e-mail Venitha at email@example.com.
Funding and Support
Computational resources are provided by The Institute for Computational and Data Sciences (ICDS).
The boot camp was first conceived and supported in Jun 2016 by the Administrative Supplement to NIGMS Predoctoral Training Grants (PA-15-136)
In this session we will discuss the reproducibility crisis in science and we will go over materials that will help us make our research reproducible.
We will have a look at StackEdit (an online markdown-based tool for lab notebook), we will learn how to use GitHub and Git for version control, as well as Galaxlay for bioinformatics analysis.
This session will introduce some basic Linux commands, how to use the computing cluster at PSU, how to set up a conda environment and how to code on Jupyter notebooks.
In this session, we will go through common bioinformatics resources and show examples of some useful tools (e.g. GNU Parallel). We will also utilize an ATAC-seq analysis pipeline as a base to introduce Snakemake, a popular workflow management tool. Finally, we will simply introduce how to use Pytest in Python.