Multi-label classification of movie posters genre using a finetuned Vision Transformer (ViT) model. The pre-trained ViT model by Google named vit-base-patch16-224 was utilized. Details about the movie posters and its genres can be found here.
Pre-trained ViT Direct Testing: pretrained.ipynb
Finetuned ViT Modelling and Testing: final.ipynb
Raw Dataset: MovieGenre.csv
Clean Dataset: clean_MovieGenre.csv
Data Preprocessing: preprocessing.ipynb
Image Scraper/Downloader: image_scraper.ipynb