-
Notifications
You must be signed in to change notification settings - Fork 10.9k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Windows VS2022 Build - Returning nonsense #2
Comments
Remove the |
Ensured to pull latest, and with the removed extra '\n' token the output is identical. If I try with a different prompt:
The output matches the same post prompt output using the example prompt!
|
What happens if you use the F16 model instead?
|
F16 model produces very much more expected results. So likely an issue in the
|
I am getting similar results but I am building it with ubuntu (wsl2), with ggml-model-f16.bin results looks good, with ggml-model-q4_0.bin I get symbols too |
Ok, that clears it - the quantization code is currently tested and optimized only on ARM NEON. If you are interested, you can keep track of the progress here: |
Gotcha makes sense, sorry for the hassle! Thanks for swift follow ups. |
Fixed bos/eos token (which is both 11 according to config.json of Fal…
add cmake commands
support axpy q4_0 for loop
* fstring ggml-org#1 * fstring ggml-org#2
* dictionary ggml-org#1 * dictionary ggml-org#2
Unsure if windows builds are expected to even function! 😄
I had to insert
ggml_time_init();
intomain()
of each astimer_freq
was being left at 0 and causing a divide by zero.Compiled with
cl main.cpp ggml.c utils.cpp /std:c++20 /DEBUG /EHsc
, same for quantize.cpp.Run with the following
main.exe -m ./LLaMA/7B/ggml-model-q4_0.bin -t 32 -n 512 -p "Building a website can be done in 10 simple steps:\n"
Produced the following output:
Not run to completion, but running with the same seed produces identical results. Will give it a poke around but unsure where to begin.
The text was updated successfully, but these errors were encountered: