Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 867 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 867 Bytes

QSS20 activities

This public repo has content for the Fall 2023 iteration of QSS20: Modern Statistical Computing at Dartmouth College. The main components are slides and associated Jupyter notebook-based activities to practice Python or other concepts. The sections and skills covered are as follows.

Data wrangling and visualization

Introduction to pandas for data wrangling

  • Activity: 00_pandas_datacleaning_blank.ipynb
  • Data: DC crime reports in 2020
  • Concepts covered:
    • Aggregation using groupby and agg
    • Lambda functions within aggregation
    • Recoding variables using np.where
    • Recoding variables using np.select
    • Recoding variables using map and dictionary

(more to come throughout the quarter)