QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Code for fine-tuning Llama2 LLM with custom text dataset to produce film character styled responses

Overview

This code utilised QLoRA parameter efficient fine-tuning techniques to create a tailored Llama2 LLM capable of returning responses in the style of Gandalf from The Lord of the Rings

Project Structure

get_gandalf_data.py - webscrapes Gandalf text dialogue data from online resources
gandalf_dataset.py - creates query/response dataset from gandalf.csv which was generated from webscraped dialogue data
hyper_params.py - defines hyperparameters for training loop
train_gandalf.py - fine-tunes base Llama2 model with custom gandalf dataset using QLoRA peft techniques
evaluate.py - loads fine-tuned Llama2 model and produces Gandalf style response to input text prompt

Author

Louis Chapo-Saunders

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Overview

Project Structure

Author

About

Releases

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
gandalf		gandalf
README.md		README.md
evaluate.py		evaluate.py
gandalf.csv		gandalf.csv
gandalf_dataset.py		gandalf_dataset.py
get_gandalf_data.py		get_gandalf_data.py
hyper_params.py		hyper_params.py
train_gandalf.py		train_gandalf.py

louisc-s/QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Folders and files

Latest commit

History

Repository files navigation

QLoRA-Fine-tuning-for-Film-Character-Styled-Responses-from-LLM

Overview

Project Structure

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages