Experiments with GPT-2

In this repo, I want to experiment with GPT-2 (124M parameter) model and understand how to train and fine tune it well. Instead of using the Hugging Face implementation, I followed Andrej Karpathy's nanoGPT implementation and made changes wherever necessary.

As a first experiment, I fine tuned the model for sentiment classification. Results and code are in the sentiment classification folder

Setup

This repository was setup using uv. To setup on your computer, install uv using

# On macOS and Linux.
curl -LsSf https://astral.sh/uv/install.sh | sh

If you are on windows, you can follow the instrunctions on the UV page.

Once you have UV installed, clone the repository by running

git clone https://github.com/varun-suresh/experiments-with-gpt2

From the project folder, run

uv sync

In the .env file, add the root directory of this repository to the PYTHONPATH env variable. Then from the root folder of this repo, run

export UV_ENV_FILE=.env

You should now have everything setup!

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
language_models		language_models
lib		lib
rag		rag
sentiment_classification		sentiment_classification
vision_models		vision_models
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Experiments with GPT-2

Setup

About

Releases

Packages

Languages

License

varun-suresh/experiments-with-gpt2

Folders and files

Latest commit

History

Repository files navigation

Experiments with GPT-2

Setup

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages