This public repo has content for the Fall 2023 iteration of QSS20: Modern Statistical Computing
at Dartmouth College. The main components are slides and associated Jupyter notebook-based activities to practice Python or other concepts. The sections and skills covered are as follows.
- Activity: 00_pandas_datacleaning_blank.ipynb
- Data: DC crime reports in 2020
- Concepts covered:
- Aggregation using
groupby
andagg
- Lambda functions within aggregation
- Recoding variables using
np.where
- Recoding variables using
np.select
- Recoding variables using
map
and dictionary
- Aggregation using
(more to come throughout the quarter)