Spring 2022 Workshop Series: Reproducibility and Data Science Skills

When

All Day, Feb. 1 to April 7, 2022

The CCT Data Science team is offering a workshop series during spring semester 2022 to help UA researchers improve their reproducibility and data science skills. This opportunity is available to all researchers in ALVSCE.

 

Objectives

  • Become comfortable with sharing and collaborating on research products on GitHub using git
  • Learn how approaches for how to better organize research projects and their corresponding code, including intermediate R skills
  • Be able to document projects in several ways, including Rmarkdowns and READMEs

This workshop series will not cover statistical methods or modeling, mathematics, introductory R, or any other programming languages besides R. These topics can be taught separately by request.

 

You should take this workshop if:

  • You are a graduate student who has collected their data and is ready to analyze and document it, or a researcher or postdoc who is looking to further skills in the areas outlined above
  • You are working on research projects that could benefit from improved collaboration, documentation, and organization
  • You are comfortable with intro R skills
  • You do not have any prior experience with version control, or want a stronger foundation

Topics for each session are listed below, and you can look at more detailed lesson materials. This workshop series has been taught previously and successfully to a group of ESA SEEDS researchers.

Eight workshop sessions will meet twice weekly through the month of February and will be hands-ons walkthroughs of content. After, participants will be able to apply new skills to their research projects. We will have a followup session for updates and problem solving, and then a final session the first week of April for short, no-slide demonstrations of progress.

Schedule
Date & time topic With Link to material
Tuesday February 1, 1-3pm Shell scripting
Thursday February 3, 1-3pm Version control with git
Tuesday February 8, 1-3pm Sharing with GitHub
Thursday February 10, 1-3pm Project management and coding best practices
Tuesday February 15, 1-3pm Data manipulation
Thursday February 17, 1-3pm Reproducibility of R code I
Tuesday February 22, 1-3pm Reproducibility of R code II and plotting I
Thursday February 24, 1-3pm Plotting II and documentation
Thursday March 24, 1-3pm Project updates and help session
Thursday April 7, 1-3pm Project demonstrations (5 minutes, no slides)

If you are interested, please fill out this application form! It should take no more than five minutes and requires no prior preparation. We will start notifying applicants on December 15, 2021, and then rolling admissions until the application closes on January 17, 2022. In order to provide one-on-one help and create a collegial atmosphere, we will be limiting the total number of participants to 15 for this workshop series.

If you have any questions, please reach out to the instructors Jessica Guo and Kristina Riemer!