Overview

This repository contains the source code used for finetuning the LLM phi-2 for programming-tasks. The methodologies used are SFT (supervised fine tuning) and DPO (direct preference optimization). To execute both script it's necessary to install the dependecies specified in the file requirements.txt. The repository also contains two notebooks for testing the performance of the fine-tuned or base model, on the HumanEval and HumanEval-X benchmarks. Regarding the latter, for computational purposes, only Java, C++ and JavaScript were selected as the languages for generating the samples; furthermore, for each language, were sampled 80 programming tasks from the corresponding json file of the HumanEval-X dataset (out of 164). Lastly, i tried to implement a simple memory for the LLM, which leverages ChromDB for storing and retrieving the embeddings to insert in the context of the model. Since not all informations retrieved are relevant to the information need explicited through the query by the user to the LLM, an hyperparameter, threshold, was added, in order to filter the least relevant informations.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
dpo_train.py		dpo_train.py
eval-base-phi-2.ipynb		eval-base-phi-2.ipynb
human-eval-x.ipynb		human-eval-x.ipynb
phi-2-with-memory.ipynb		phi-2-with-memory.ipynb
requirements.txt		requirements.txt
sft_training.py		sft_training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

About

Releases

Packages

Languages

eyess-glitch/phi-2-fine-tuning

Folders and files

Latest commit

History

Repository files navigation

Overview

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages