Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Llama cuda support? #138

Open
jojo2357 opened this issue May 4, 2024 · 1 comment
Open

Llama cuda support? #138

jojo2357 opened this issue May 4, 2024 · 1 comment

Comments

@jojo2357
Copy link

jojo2357 commented May 4, 2024

I have compiled llama.cpp with the LLAMA_CUDA option and I notice that running an edge model does not use the GPU at all. Is there something I should look for in my config?

Also, would it be possible to download models other than the LIBERTY - EDGE models? I assume that I could get more inference earnings if I had a more popular model, too.

(Running on Ubuntu Linux with proprietary nvidia drivers)

@mhtvsSFrpHdE
Copy link

mhtvsSFrpHdE commented Jul 14, 2024

Hi, I've just discovered how to hack llama startup arguments

#144

Can you try and share what arguments do you use so you can enable GPU acceleration?
I also don't know how to compile the one yet on Windows... If any information, steps, commands is so thankful.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants