This repo includes projects completed as elements of my portfolio. See findings and summary walkthroughs for them them displayed on my website here.
The code for the following projects is found in their respective folders here.
- cfpb - This project includes a natural language processing analysis of topics discussed by consumers as part of their complaints to the Consumer Financial Protection Bureau.
- CFPB-Exploratory: Notebook that explores CFPB data and includes some data visualizations.
- CFPB-LDA: Notebook that walks through a Latent Dirichlet Allocation topic modeling analysis of CFPB complaints.
- CFPB-Company Topics: Examination of effect on company topics and product distributions given an enforcement action taken by the CFPB.
- tax_votes - This project combines data from the US IRS, and examines how tax filings, aggregated at the county level, predict the relationship with propensity to vote for Hillary Clinton in the 2016 presidential election. The most impactful variables in a random forest analysis of this relationship was the number of people with student loan debt in a county.
- he_research - This project is a clustering analysis of institutions of higher education, using data available from The National Center for Science and Engineering Statistics.