This project is aimed at developing code to scrape, analyze, and visualize tweets for corporate accounts(companies).
I am currently applying to various companies. For each job I much do various research to see if I think it might be a good fit for me. In the process of this research typically I will browse their social media platforms. Thus, I have automated this process to scrape, compile, and analyze Twitter tweets from each company I am seriously considering.
Python Libraries
- Twint (python library) to scrape Twitter tweets (currently only able to install using the dockerfile)
- Pandas to organize the tweets
- Matplotlib / Seaborn to visualize the tweets
- Scikit-learn to organize parts of the data
- Wordcloud: to create visual wordclouds of the mentions and hashtags
- FPDF: to compile it all into an easy to consume PDF Analysis Report
- Scrape tweets using twint
- Organize data using various pandas and scikit-learn techniques
- Plot data points of interest to see some interesting trends in tweets
- Compare tweets with the top 3 mentions within all of the companies tweets
- Export charts to a PDF created within python
- More...