Skip to content
Erika Austhof edited this page Sep 28, 2022 · 19 revisions

Welcome to the LearningDataScience wiki!

This Wiki is for the Roots for Resilience Research Assistanship. Learn more about Roots for Resilience from the UArizona Data Science Institute.

Tasklist

  • Determine a project to use for the R for Reproducibility workshop, maybe the Rain analysis?
  • [ x ] Write a function to merge PRISM data, maybe a loop?
  • See if python-climate-indices would be a good option for getting drought data for thesis project
  • [ x ] Cyverse Intro Training

Updates

  • 9/28/2022: This week we discussed project management in the FOSS workshops. I really liked the idea of a binding document for a project that details expectations, who is doing what, and the goals/objectives for a project. I was really intimidated about the research object...it seems like a massive project that would take a long time to complete. I'm hoping with the FOSS workshops we'll get more understanding of how to do this type of work in the future.
    • Issues: This week I'm having issues with getting all the health data I need for my project, working through data use agreements, IRB applications, and working with partners to get data at a level that makes sense. We are making slow progress!
    • Takeaways: I merged all of the data for my project with CO and cleaned up my R code so that my filepaths are relational, rather than hard-coded to my own folder. I am planning to turn this R project into a GitHub for my collaborators to provide comments on my code as well!

Previous Updates

  • 9/21/2022: This week we started the FOSS workshops and discussed open science. I also attended the Reproducibility in R workshops where we learned about branches and forking in GitHub and overall project management using git and GitHub.
    • Issues: This week I am having issues with pulling in multiple datasets offline and merging them in R. Since I'm working with large datasets of climate data, I've figured out how to do it manually, but it takes awhile, and I'd like to learn how to do it more efficiently.
    • Takeaways: I am seeing how I could use R for my thesis work more clearly. I like the idea of setting up a private repository for my collaborators across states so that we can work together on code and data.
  • 9/14/2022: This week I set up my notebook for the R4R cohort, and this is the first edition of the notebook for the 9/14 deadline. Feel free to navigate the pages on the right, and see updates down below for my learning progress so far.
  • 9/8/2022: This week we started the pre-workshop for FOSS, and learned about the differences between open science and FAIR principles. Check out the associated new pages for: Docker, FAIR principles, and FOSS workshops for learning activities.
    • Issues: I am still unsure how I will use Docker in my own work, but am excited to learn more as FOSS continues.
    • Takeaways: I was excited to learn about data standards in public health, and I am getting more comfortable with navigating GitHub!
  • 9/1/2022: This week I reviewed the lectures from previous sessions and got my computer set up for RStudio and GitHub.
    • Issues: R programming is still new to me, but I am working through a self-paced course to help with coding skills.
Clone this wiki locally