This project is to classify the movie ROI rate and predict the revenue in online movie dataset of IMDb and TMDb from 1980 to 2017 by using visualization and machine learning algorithms
Link for the whole project: https://drive.google.com/drive/folders/1a_Ot_xloD-2mZQhzNvMSPQK6MGsnqbvb?usp=sharing
Required datasets:
- name.tsv
- title.tsv
- IMDb movies.csv
- movies_metadata.csv
- cast.csv
- IMDb ratings.csv
Please run Jupyter Notebook files in exact order:
- extract.ipynb
- data.ipynb
- statistic.ipynb
- modeling.ipynb
Please find pdf file for the report