Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Has anyone tried Dolly-like models? #558

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Closed
kha84 opened this issue Mar 27, 2023 · 3 comments
Closed

Has anyone tried Dolly-like models? #558

kha84 opened this issue Mar 27, 2023 · 3 comments

Comments

@kha84
Copy link

kha84 commented Mar 27, 2023

I just watched the latest video of my favorite youtuber - https://www.youtube.com/watch?v=AWAo4iyNWGc&t=14s and was wondering, if someone has already quantized & converted one of these to be compatible with llama.cpp?
The beauty of Dolly-like models is that they're based on open source gpt-j-6B from EleutherAI, so noone will be hunting us for using them without an ask.

@kha84 kha84 changed the title Have anyone tried Dolly-like models? Has anyone tried Dolly-like models? Mar 27, 2023
@pikalover6
Copy link

ggml already supports gpt-j, you should just be able to convert and quantize them.

@gorborukov
Copy link

gorborukov commented Mar 27, 2023

I did not succeed with the convert-h5-to-ggml.py script and the model Dolly_GPT-J-6b. Script thinks for a while and crashes with the status "Killed".
Screenshot from 2023-03-28 01-02-58

@kha84
Copy link
Author

kha84 commented Mar 27, 2023

I did not succeed with the convert-h5-to-ggml.py script and the model Dolly_GPT-J-6b. Script thinks for a while and crashes with the status "Killed". Screenshot from 2023-03-28 01-02-58

Prob out-of-memory killer played his part? Check journalctl / syslog and try to monitor system resources while running it

@ggml-org ggml-org locked and limited conversation to collaborators Mar 28, 2023
@gjmulder gjmulder converted this issue into discussion #569 Mar 28, 2023

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants