Don't force immediate interactive without `-i` #354

tjohnman · 2023-03-21T14:16:15Z

Sometimes we might want to use a reverse prompt but we want to let the model generate tokens right after the initial prompt. So we don't force user input mode if the -i flag wasn't specified and instead let it run until we encounter the reverse prompt.

This gives use some more flexibility, since it doesn't force the user to enter a newline if they want to let the model generate text right after the initial prompt and only be asked for input if the reverse prompt is encountered.

blackhole89 · 2023-03-21T16:41:45Z

I think it's good to not force interactive mode immediately (in fact that was how it worked when I first made the patch, but the logic seems to have changed at some point), but in this combination the flags seem to be rendered a bit misleading.

What I conceived of in the beginning:

No -i or -r PROMPT: Classical, non-interactive generate-only mode.
-i: Interactive mode, in which the user may add additional input during generation by pressing Ctrl+C to seize control.
--interactive-first: As -i, plus prompt for user input immediately after the initial prompt is emitted.
-r PROMPT: As -i, plus look for reverse prompt PROMPT, so control passes to the user whenever it is encountered.

The scenario you are proposing:

No -i or -r PROMPT: Classical, non-interactive generate-only mode.
Not available: interactive mode, where further user input is prompted after Ctrl+C only.
-i: Interactive mode, plus prompt for user input immediately after the initial prompt is emitted.
-r PROMPT: Interactive mode, plus look for reverse prompt PROMPT.

I think (2) is a valid use case, and moreover it is confusing that the "interactive mode" flag -i actually takes you into the "interactive + input in the beginning" mode while in order to get into any sort of interactive mode where you do not want to start out submitting input, you have to specify a reverse prompt with -r and not submit -i (and the natural UX for attaining 2 is -r RANDOM_STRING_THAT_NEVER_WILL_COME_UP).

Rather than loading initially_interacting from params.is_interactive, I would therefore suggest (re)introducing a dedicated parameter corresponding to it.

tjohnman · 2023-03-21T16:48:49Z

@blackhole89 I agree 100% with you that the first scenario is the most intuitive and useful (I'll do the changes). I did not remove --interactive-first (but I do remember seeing it in a previous build; no idea what happened to it).

blackhole89 · 2023-03-21T16:54:38Z

@tjohnman Thanks! Wasn't meaning to imply you had anything to do with the removal - development has been moving quickly and chaotically, it probably just fell on the wayside in some refactoring along the way.

Sometimes we might want to use a reverse prompt but we want to let the model generate tokens right after the initial prompt. So we don't force user input mode if the -i flag wasn't specified and instead let it run until we encounter the reverse prompt. This gives use some more flexibility, since it doesn't force the user to enter a newline if they want to let the model generate text right after the initial prompt and only be asked for input if the reverse prompt is encountered. The `--interactive-first` flag is reintroduced to force the old behavior. `-r` behaves like `-i` plus introduces a reverse prompt (it can be specified more than once).

Green-Sky · 2023-03-22T20:53:48Z

this kind of broke instruction mode. this change needs to be only for --interactive not for --instruct

Green-Sky · 2023-03-22T20:59:03Z

main.cpp

@@ -1032,7 +1036,7 @@ int main(int argc, char ** argv) {
 #endif
               " - Press Return to return control to LLaMa.\n"
               " - If you want to submit another line, end your input in '\\'.\n\n");
-        is_interacting = true;
+        is_interacting = params.interactive_start;


params.interactive_start || params.instruct

Co-authored-by: Johnman <tjohnman@github>

gjmulder added the enhancement New feature or request label Mar 21, 2023

Johnman added 2 commits March 21, 2023 18:21

Update help output.

98570dd

tjohnman force-pushed the no-forced-interactive-start branch from 77d9a8a to 98570dd Compare March 21, 2023 17:25

tjohnman mentioned this pull request Mar 21, 2023

Replace EOS with newline to prevent context/memory being flushed by EOS in interactive mode #333

Merged

blackhole89 approved these changes Mar 22, 2023

View reviewed changes

ggerganov merged commit 305ba6f into ggml-org:master Mar 22, 2023

tjohnman deleted the no-forced-interactive-start branch March 22, 2023 17:45

Green-Sky reviewed Mar 22, 2023

View reviewed changes

tjohnman pushed a commit to tjohnman/llama.cpp that referenced this pull request Mar 22, 2023

Fix instruct mode broken by PR ggml-org#354

ce33900

Green-Sky pushed a commit that referenced this pull request Mar 23, 2023

Fix instruct mode broken by PR #354 (#409)

f7dc43b

Co-authored-by: Johnman <tjohnman@github>

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't force immediate interactive without `-i` #354

Don't force immediate interactive without `-i` #354

tjohnman commented Mar 21, 2023 •

edited

Loading

blackhole89 commented Mar 21, 2023 •

edited

Loading

tjohnman commented Mar 21, 2023 •

edited

Loading

blackhole89 commented Mar 21, 2023

Green-Sky commented Mar 22, 2023

Green-Sky Mar 22, 2023 •

edited

Loading

Don't force immediate interactive without -i #354

Don't force immediate interactive without -i #354

Conversation

tjohnman commented Mar 21, 2023 • edited Loading

blackhole89 commented Mar 21, 2023 • edited Loading

tjohnman commented Mar 21, 2023 • edited Loading

blackhole89 commented Mar 21, 2023

Green-Sky commented Mar 22, 2023

Green-Sky Mar 22, 2023 • edited Loading

Choose a reason for hiding this comment

Don't force immediate interactive without `-i` #354

Don't force immediate interactive without `-i` #354

tjohnman commented Mar 21, 2023 •

edited

Loading

blackhole89 commented Mar 21, 2023 •

edited

Loading

tjohnman commented Mar 21, 2023 •

edited

Loading

Green-Sky Mar 22, 2023 •

edited

Loading