Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

LLama 2 and Code LLama support? #73

Open
Ananderz opened this issue Sep 4, 2023 · 2 comments
Open

LLama 2 and Code LLama support? #73

Ananderz opened this issue Sep 4, 2023 · 2 comments

Comments

@Ananderz
Copy link

Ananderz commented Sep 4, 2023

I have been trying to get llama 2 models to function correctly. They start off ok but then all of them goes into a loop with repetitions or gibberish.

I haven't tried setting model_type:llama to something else, could it be that we need to add llama2 here instead?

model_type: llama

Possible to get any of the code llms to support this ?

@Ananderz
Copy link
Author

Ananderz commented Sep 4, 2023

I tried with llama-2 and llama2 and read the ctransformers documentation and realized its just llama.

The answer gets into a loop when using llama2 models:

The telecom industry is not not not not not not not not not not not not not not not

Like that, I read somewhere that it could be related to something RoPE but don't know how to set that!

@Ananderz
Copy link
Author

Fixed it by implementing prompt template!

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant