Welcome
This page contains important information about the Sixth BSD-qBio Boot Camp, which will be held online from Sept 13-18, 2020.
More that 100 incoming graduate students from the different programs in the Division of Biological Sciences at the University of Chicago will participate.
On this page, you find the instructions on how to prepare your laptop so that it’s ready for the boot camp.
You should also take a look at the data and the code we will explore in the workshops and tutorials.
You can see the schedule of the boot camp, and find the contact information for the directors.
Contacts
For any issue/question/comment, please contact the course directors via discord:
Instructors
- Stefano Allesina (webpage)
- Peter Carbonetto (webpage)
- Lin Chen (webpage)
- Mengjie Chen (webpage)
- Aly Khan (webpage)
- Samantha Riesenfield (webpage)
- Matthew Stephens (webpage)
TAs
- Jill Rosenberg
- Graham Smith
- Zepeng (Phoenix) Mu
- Katie Aracena
- Brendan MacNabb
- Dylan Sosa
- Evan Kiefl
- Kate Farris * Neil Sheth
Schedule
The qBio boot camp will be, as the name implies, quite intense. We are going to have Tutorials (short primers to a certain topic) and Workshops (discipline-specific, hands-on activities).
Here’s the general schedule.
You can also browse the schedule by group.
Computing tutorials
To accommodate the diverse background of our students, we have created two tracks for the computing tutorials.
- Basic Computing I and II: dedicated to new users who are not familiar with
R
or programming in general. It will guide students step-by-step, introducing theR
syntax and showing how to write well-organized code for data analysis and scientific research. - Advanced Computing I and II: dedicated to experienced
R
users, thesee will focus on manipulating large data sets, plotting, and the use of regular expressions.
You will need to decide which track’s sessions to attend on Day 1 of the Bootcamp. After Day 1, you will move throught the material in teams with mixed skill levels. Choose which track to join for Day 1 by consulting the lecture materials and making sure that the content is at the right level for you:
- Basic Computing I (web, pdf)
- Basic Computing II (web, pdf)
- Advanced Computing: Read the challenges here: (Data Jujutsu)
Special preparation for Advanced Computing: You should work through the Advanced Computing preparatory material before the session begins. Link to preparatory material
Preparing your laptop
We are going to start working right away. Therefore, it is very important you prepare your laptop for the boot camp before the first session on Monday. This will take you about one hour, so schedule accordingly.
You will work on your laptop all day long. If you don’t have a laptop, please contact the course directors immediately.
Installation of R
and R
packages
-
Install R: go to this page, download the file corresponding to your platform, and install it. (Here’s a video explaining how to install R and RStudio in Windows; here for Mac OSX)
-
Install RStudio: once installed R, go to this page, download the installer for your operating system (section Installers for Supported Platforms), and install the software.
-
Once installed R and RStudio, open RStudio and install the following packages:
- devtools
- tidyverse
- knitr
- workflowr
- ggthemes
- cowplot
- Rtsne
- BiocManager
- ggseqlogo
- pheatmap
You can find instructions on how to install
R
packages in RStudio here. -
Other packages After the package installs above, two more sets of the packages need to be installed using special installers within
R
. First, open Rstudio and in the Console typelibrary(devtools)
, hit Return (or Enter) and theninstall_github("jdstorey/qvalue")
. This will install the libraryqvalue
that is needed for one of the tutorials. Second, typelibary(BioCManager)
hit Return (or Enter) and then,BiocManager::install(c("airway","Rsamtools","Rsubread","DESeq2","vsn", "org.Hs.eg.db","GenomicFeatures", "clusterProfiler"))
. This should install 8 libraries that will be used for the RNAseq workshop. -
UNIX Emulator: If you are using Windows, you need to install a UNIX emulator. We suggest downloading the version control software
Git
, because it ships with a small emulator (Git Bash
). Simply go to this page and follow the instructions. -
Git
Downloading the data
It is very important to download the data before the workshop, as the files are quite large. (Warning! The repo is >200MB!)
All you need to do is to download the repository containing all the boot camp lectures and data.
We will download the repository using GitKraken (you can alternatively use command-line git
if you are already familiar with it).
- Open GitKraken (see download link above)
- Login with your GitHub account (see instructions and link above)
- “Clone a Repo” (in File menu)
- “Clone with URL”
- “Where to Clone:” Browse to the folder in which you want to keep your repository (your home directory is fine)
- “URL”: Paste:
https://github.com/jnovembre/BSD-QBio6.git
- “Clone the Repo!” (this step will take a few minutes)
Now if you go to the folder you chose in step 5, you’ll see the repository!
Programming Challenges
During the boot camp, the 12 groups of students will compete through 5 programming challenges. Here are the links to the webpages where the groups should post their solutions (one answer per group, please):
- Submit your answer to Programming Challenge 1 (Basic Programming I)
- Submit your answer to Programming Challenge 2 (Basic Programming II)
- Submit your answer for the tutorial on Reproducibility
- Submit your answer for the tutorial on Data Visualization
- Submit your answer for the tutorial on Stats for large data
Notes
This material is based upon work supported by the National Science Foundation under Grant Number 1734818
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.