Replies: 1 comment
-
It turns out the main culprit was the 'enforce_isolation' option on kernel 6.12. Turning it off eliminates most of the lagging, although I still get some stuttering on VAE processing and at the very beginning of prompt processing. |
Beta Was this translation helpful? Give feedback.
0 replies
# for free
to join this conversation on GitHub.
Already have an account?
# to comment
-
On my aging 3400G, the whole desktop GUI (Linux, either X11+XFCE or Wayland+KDE) tends to freeze completely during llama.cpp/stable-diffusion.cpp more intensive GPU computations (on Vulkan). From low to high impact:
Also, these 'choking' events sometimes trigger driver bugs, causing full system lock-ups.
So, I'm looking for ways to throttle GPU usage during inference. What I tried so far:
ctx->device->device.waitIdle()
+ sleep beforeggml_vk_build_graph
calls: kind of works as a proof-of-concept thing, but of course is no real solution.Thoughts?
Beta Was this translation helpful? Give feedback.
All reactions