Skip to content

Multi-label classification of movie posters genre using a finetuned Vision Transformer (ViT) model.

Notifications You must be signed in to change notification settings

KyleZrey/genre-classification-ViT

Repository files navigation

genre-classification-ViT

Multi-label classification of movie posters genre using a finetuned Vision Transformer (ViT) model. The pre-trained ViT model by Google named vit-base-patch16-224 was utilized. Details about the movie posters and its genres can be found here.

Pre-trained ViT Direct Testing: pretrained.ipynb

Finetuned ViT Modelling and Testing: final.ipynb

Raw Dataset: MovieGenre.csv

Clean Dataset: clean_MovieGenre.csv

Data Preprocessing: preprocessing.ipynb

Image Scraper/Downloader: image_scraper.ipynb

About

Multi-label classification of movie posters genre using a finetuned Vision Transformer (ViT) model.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published