examples : evaluate tokens in batches after swapping context #1014

grencez · 2023-04-16T09:57:11Z

This new loop around llama_eval is a bit redundant with the batching done in the main loop, but without a refactor it's all still necessary to keep print statements happening at the right times.

grencez · 2023-04-17T20:06:51Z

Tests passed yesterday. I just synced recent changes and added a comment.

examples/main/main.cpp

grencez force-pushed the batching branch 5 times, most recently from 26748b2 to 3bc0a89 Compare April 16, 2023 10:22

grencez changed the title ~~Evaluate tokens in batches after swapping context~~ examples: Evaluate tokens in batches after swapping context Apr 16, 2023

grencez changed the title ~~examples: Evaluate tokens in batches after swapping context~~ examples : evaluate tokens in batches after swapping context Apr 16, 2023

grencez marked this pull request as ready for review April 16, 2023 10:30

examples : evaluate tokens in batches after swapping context

d1f0210

grencez force-pushed the batching branch from 3bc0a89 to d1f0210 Compare April 17, 2023 20:02

ggerganov approved these changes Apr 21, 2023

View reviewed changes

examples/main/main.cpp Outdated Show resolved Hide resolved

Update examples/main/main.cpp

80d1c16

ggerganov merged commit 9411288 into ggml-org:master Apr 21, 2023

grencez deleted the batching branch April 21, 2023 21:09

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples : evaluate tokens in batches after swapping context #1014

examples : evaluate tokens in batches after swapping context #1014

grencez commented Apr 16, 2023

grencez commented Apr 17, 2023

examples : evaluate tokens in batches after swapping context #1014

examples : evaluate tokens in batches after swapping context #1014

Conversation

grencez commented Apr 16, 2023

grencez commented Apr 17, 2023