llama : correctly report GGUFv3 format #3818

cebtenzzre · 2023-10-27T17:12:44Z

Follow-up to #3552.

Before:

llm_load_print_meta: format           = unknown

After:

llm_load_print_meta: format           = GGUFv3 (latest)

Will GGUFv2 be deprecated like GGUFv1 was?

edit: I guess it doesn't matter since for little-endian it's just a version bump AFAIK.

* master: (350 commits) speculative : ensure draft and target model vocab matches (ggml-org#3812) llama : correctly report GGUFv3 format (ggml-org#3818) simple : fix batch handling (ggml-org#3803) cuda : improve text-generation and batched decoding performance (ggml-org#3776) server : do not release slot on image input (ggml-org#3798) batched-bench : print params at start log : disable pid in log filenames server : add parameter -tb N, --threads-batch N (ggml-org#3584) (ggml-org#3768) server : do not block system prompt update (ggml-org#3767) sync : ggml (conv ops + cuda MSVC fixes) (ggml-org#3765) cmake : add missed dependencies (ggml-org#3763) cuda : add batched cuBLAS GEMM for faster attention (ggml-org#3749) Add more tokenizer tests (ggml-org#3742) metal : handle ggml_scale for n%4 != 0 (close ggml-org#3754) Revert "make : add optional CUDA_NATIVE_ARCH (ggml-org#2482)" issues : separate bug and enhancement template + no default title (ggml-org#3748) Update special token handling in conversion scripts for gpt2 derived tokenizers (ggml-org#3746) llama : remove token functions with `context` args in favor of `model` (ggml-org#3720) Fix baichuan convert script not detecing model (ggml-org#3739) make : add optional CUDA_NATIVE_ARCH (ggml-org#2482) ...

* ggml-org/llama.cpp#3818

llama : correctly report GGUFv3 format

d055fed

cebtenzzre requested a review from ggerganov October 27, 2023 17:12

KerfuffleV2 approved these changes Oct 27, 2023

View reviewed changes

cebtenzzre merged commit 6d459cb into ggml-org:master Oct 27, 2023

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 17, 2023

Report ggufv3 correctly

e607131

* ggml-org/llama.cpp#3818

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

llama : correctly report GGUFv3 format (ggml-org#3818)

53d1471

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 30, 2023

Report ggufv3 correctly

4ddfbae

* ggml-org/llama.cpp#3818

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : correctly report GGUFv3 format #3818

llama : correctly report GGUFv3 format #3818

cebtenzzre commented Oct 27, 2023 •

edited

Loading

llama : correctly report GGUFv3 format #3818

llama : correctly report GGUFv3 format #3818

Conversation

cebtenzzre commented Oct 27, 2023 • edited Loading

cebtenzzre commented Oct 27, 2023 •

edited

Loading