support chatml-function-calling via llama-cpp #4

dnakov · 2024-02-27T19:50:34Z

support chatml-function-calling via llama-cpp

trufae · 2024-02-28T10:00:05Z

i would prefer to merge smaller changes rather than having so many items in this PR. also you wont need to be merging and rebasing that frequently.

I am working on the markdown fix because the current interpreter.py have some legacy code that is broken and its not really working well with some models right now. I hope to get this done hopefully today

trufae · 2024-02-28T10:02:55Z

See the EPIC ticket :)

dnakov · 2024-02-28T12:13:21Z

sounds good, i'll split them up!

trufae · 2024-02-28T15:34:44Z

So it's ok to merge now?

dnakov · 2024-02-28T15:52:54Z

not yet, i haven't done any regression testing yet, ill let you know soon

trufae · 2024-02-28T18:31:54Z

I want to merge this #6 as it's actually fixing the behaviour of code colorization and ive tested it with several models. the code is still not fully cleaned but its much simpler and at least it works as expected.

is it ok for you for me to merge it?

dnakov · 2024-02-28T19:24:00Z

Yeah go ahead, I'll deal with any merge conflicts when this is ready

dnakov · 2024-02-28T21:08:18Z

ok this works now, merged with your changes. Not getting any good results with mistral, but that's more of a model problem. Hopefully functionary will be a lot better

trufae · 2024-02-28T21:25:16Z

Ready to merge to start experimenting with it? I think enabling vectordb is important for the auto mode, as well as extending the doc/data stuff to give more contextual hints about how to achieve things and which commands use

trufae · 2024-02-28T21:43:59Z

uh "This branch cannot be rebased due to conflicts"

trufae · 2024-02-28T22:50:37Z

conflicts are too large to be resolved via github, please fix them and force push that.

i would recommend you for the next PR to use a separate branch (not named master) and use git rebase instead of git merge

trufae · 2024-02-28T23:12:08Z

Also, note that commit messages must be capitalized, and i would recommend you to squash the commits

dnakov · 2024-02-28T23:19:58Z

ok should be good now

dnakov · 2024-02-28T23:32:37Z

Ready to merge to start experimenting with it? I think enabling vectordb is important for the auto mode, as well as extending the doc/data stuff to give more contextual hints about how to achieve things and which commands use

yeah, to the extent of if you -m TheBloke/Mistral-7B-Instruct-v0.2-GGUF then prompt it with ' , it'll use that model and it will call some function, but chatml-function-calling and the models are not "smart" and reliable enough to know when to stop calling functions and send a message. This can probably be offset a bit with different prompting + RAG, but my hope is for functionary.

How do you envision structuring the RAG data? What's the best "use case -> commands" documentation/reference that we can shove into the vectordb? If we just put the r2 docs in there, i don't think we'd get the any good results for high level queries like "solve this crackme" or "what's the password"

trufae · 2024-02-29T11:46:53Z

We may probably update to the latest mistral models in the -M output (i filled a ticket for this). ive tested your code and it's not really working well. not sure if there's a way to debug what the model is doing internally to trace what's going on.. maybe via -e debug=true ?

About vectordb, what it does is to send the user prompt to the database and the database returns a list of sentences that can be prepended to the query for contextual information. this data can provide instructions to perform actions or information about the answer the user is looking for. so its transparent to the user and works across all models. i think integrating this in the auto mode will help a lot in the local results too.

dnakov · 2024-02-29T11:56:38Z

Yeah ive gotten it to work only like once by luck. I'll add some debugging.

About the vectors, yeah, I know how RAG works, I mean what text are you thinking of putting in there, do you have examples?

trufae mentioned this pull request Feb 28, 2024

EPIC: Auto mode #5

Open

6 tasks

dnakov changed the title ~~[WIP] auto mode for local LLMs~~ support chatml-function-calling via llama-cpp Feb 28, 2024

dnakov force-pushed the master branch from eac06a1 to a84c8a3 Compare February 28, 2024 23:17

Support chatml-function-calling via llama-cpp

98d0272

dnakov force-pushed the master branch from a84c8a3 to 98d0272 Compare February 28, 2024 23:18

trufae merged commit 52b6c26 into radareorg:master Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support chatml-function-calling via llama-cpp #4

support chatml-function-calling via llama-cpp #4

dnakov commented Feb 27, 2024 •

edited

Loading

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 29, 2024

dnakov commented Feb 29, 2024

support chatml-function-calling via llama-cpp #4

support chatml-function-calling via llama-cpp #4

Conversation

dnakov commented Feb 27, 2024 • edited Loading

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

trufae commented Feb 28, 2024

dnakov commented Feb 28, 2024

dnakov commented Feb 28, 2024

trufae commented Feb 29, 2024

dnakov commented Feb 29, 2024

dnakov commented Feb 27, 2024 •

edited

Loading