Fix bug in main.cpp (penalize_nl=false doesn't work). Supress warning on mingw. #1528

tom7 · 2023-05-19T17:00:06Z

Pretty sure this is just a bug, but it's always possible I'm missing something!

…s the underlying logits array, but at this point we are already working on the candidates copy.

…on, this macro is already defined by /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45.

DannyDaemonic · 2023-05-19T18:29:46Z

this macro is already defined by /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45.

So many little mingw oddities cropping up. I'm curious though, any idea what's including os_defines.h?

ggerganov · 2023-05-19T18:40:41Z

examples/main/main.cpp

+                      candidates_p.data[idx].logit = nl_logit;
+                      break;
+                    }
+                  }


Hmm, the order of candidates_p does not seem to be changed.
Why do you think it has?

It's not that the order of candiates_p has changed, although IMO it's not great to rely on the fact that llama_sample_repetition_penalty and llama_sample_frequency_and_presence_penalties preserve the order (without at least documenting that). The main issue is that on line 418 we've copied the logits into local candidates vector (of llama_token_data_array, not raw floats). So modifying the original logit array from the context does nothing, right?

@tom7 Apologise for the delay and not paying more attention to this - I think you are right and we've had this bug for quite some time now.

ivanstepanovftw

I do not see bug here. Order of logits is not changed after penalization

tom7 · 2023-05-19T19:43:39Z

Thanks for the quick look!

this macro is already defined by /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45.

So many little mingw oddities cropping up. I'm curious though, any idea what's including os_defines.h?

Apparently it's via <string>:

examples/main/main.cpp:27: warning: "NOMINMAX" redefined
   27 | #define NOMINMAX
      |
In file included from /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/c++config.h:586,
                 from /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/string:38,
                 from ./examples/common.h:7,
                 from examples/main/main.cpp:6:
/usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45: note: this is the location of the previous definition
   45 | #define NOMINMAX 1

martindevans · 2023-08-26T17:58:19Z

It looks like this PR didn't gain much traction, but I think I just discovered the same bug that this PR fixes.

As I understand it the current situation is:

However, as far as I can see logits is never referenced again after the copy (in step 2). Which means that writing a value to it in step 5 is pointless.

This PR changes it to find the nl token in candidates_p and overwrite that value instead.

ggerganov

@martindevans Thanks for bringing attention again to this

@tom7 Sorry for the delay of about ~3 months 😄

* master: (773 commits) server : add `/detokenize` endpoint (ggml-org#2802) convert.py : advanced option (ggml-org#2753) llama : use Unicode Escape Sequence to replace encoded characters (ggml-org#2814) flake.nix : add rocm support and cleanup (ggml-org#2808) llama : move #includes out of _GNU_SOURCE conditional (ggml-org#2817) main : fix bug (penalize_nl=false doesn't work) + suppress warning on mingw (ggml-org#1528) llama : use std::abs in llama_sample_tail_free (ggml-org#2800) k-quants : remove unnecessary tensor shape restrictions (ggml-org#2811) Better perplexity for 2- and 3-bit quantization for LLaMA-v2-70B (ggml-org#2807) Fix HellaSwag (ggml-org#2805) flake : build llama.cpp on Intel with nix (ggml-org#2795) Handle null rope scaling value (ggml-org#2793) Fix spm whitespaces (ggml-org#2806) examples : skip unnecessary external lib in server README.md how-to (ggml-org#2804) llama : fix struct decl (ggml-org#2790) Faster perplexity computation (ggml-org#2786) llama : add llama_beam_search() (ggml-org#2267) convert.py : Get rope scale from HuggingFace models (ggml-org#2772) llama-bench : add model sizes (ggml-org#2771) convert.py : export rope freq_base when converting CodeLlama from an HF model (ggml-org#2773) ...

… mingw (ggml-org#1528) * Fix bug in main.cpp where penalize_nl=false has no effect. It modifies the underlying logits array, but at this point we are already working on the candidates copy. * Suppress redefinition warning for NOMINMAX on mingw. In my installation, this macro is already defined by /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45. * main : fix indentation * main : pass ctx to llama_token_nl() --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

tom7 added 2 commits May 19, 2023 12:50

Fix bug in main.cpp where penalize_nl=false has no effect. It modifie…

e12f3fb

…s the underlying logits array, but at this point we are already working on the candidates copy.

Suppress redefinition warning for NOMINMAX on mingw. In my installati…

311bd6d

…on, this macro is already defined by /usr/lib/gcc/x86_64-w64-mingw32/11/include/c++/x86_64-w64-mingw32/bits/os_defines.h:45.

DannyDaemonic requested a review from ivanstepanovftw May 19, 2023 17:14

ggerganov reviewed May 19, 2023

View reviewed changes

ivanstepanovftw reviewed May 19, 2023

View reviewed changes

ggerganov added 3 commits August 26, 2023 21:02

main : fix indentation

f5d4b48

Merge branch 'master' into master

0d29c8a

main : pass ctx to llama_token_nl()

1bf050c

ggerganov approved these changes Aug 26, 2023

View reviewed changes

ggerganov merged commit 72f895c into ggml-org:master Aug 26, 2023

martindevans mentioned this pull request Aug 26, 2023

Removed unnecessary parameters from some low level sampler methods SciSharp/LLamaSharp#125

Merged

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix bug in main.cpp (penalize_nl=false doesn't work). Supress warning on mingw. #1528

Fix bug in main.cpp (penalize_nl=false doesn't work). Supress warning on mingw. #1528

Uh oh!

tom7 commented May 19, 2023

Uh oh!

DannyDaemonic commented May 19, 2023

Uh oh!

ggerganov May 19, 2023

Uh oh!

tom7 May 19, 2023

Uh oh!

ggerganov Aug 26, 2023

Uh oh!

ivanstepanovftw left a comment

Uh oh!

tom7 commented May 19, 2023

Uh oh!

martindevans commented Aug 26, 2023

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

Fix bug in main.cpp (penalize_nl=false doesn't work). Supress warning on mingw. #1528

Fix bug in main.cpp (penalize_nl=false doesn't work). Supress warning on mingw. #1528

Uh oh!

Conversation

tom7 commented May 19, 2023

Uh oh!

DannyDaemonic commented May 19, 2023

Uh oh!

ggerganov May 19, 2023

Choose a reason for hiding this comment

Uh oh!

tom7 May 19, 2023

Choose a reason for hiding this comment

Uh oh!

ggerganov Aug 26, 2023

Choose a reason for hiding this comment

Uh oh!

ivanstepanovftw left a comment

Choose a reason for hiding this comment

Uh oh!

tom7 commented May 19, 2023

Uh oh!

martindevans commented Aug 26, 2023

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!