Skip to content

How do I use other pre-trained models for this project? #417

Answered by rasbt
zefrommeister asked this question in Q&A
Discussion options

You must be logged in to vote

Hi there,
I have some bonus material on converting the GPT-2 model to a Llama model here: https://github.com/rasbt/LLMs-from-scratch/tree/main/ch05/07_gpt_to_llama (it's a step-by-step guide for educational purposes).

I also made a notebook with the Llama 3.2 1B standalone code that you could use to replace the GPT-2 model code in Chapter 4 and later: https://github.com/rasbt/LLMs-from-scratch/blob/main/ch05/07_gpt_to_llama/standalone-llama32.ipynb

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@zefrommeister
Comment options

Answer selected by rasbt
# for free to join this conversation on GitHub. Already have an account? # to comment
Category
Q&A
Labels
None yet
2 participants