llama.cpp streaming support for tool calling & thoughts was just merged: please test & report any issues π
github.com/ggml-org/lla...
#llamacpp
@ochafik.bsky.social
OpenSCAD & llama.cpp contributor. Used to dream in Scala, now waiting for a reason to learn Rust. ex-Googler (views expressed = my own) he/him π³οΈβπ Also found at: @ochafik@fosstodon.org (Slowly moving out of https://x.com/ochafik)
llama.cpp streaming support for tool calling & thoughts was just merged: please test & report any issues π
github.com/ggml-org/lla...
#llamacpp
Runs anywhere (incl. Raspberry Pi 5).
On a Mac:
brew install llama.cpp
llama-server --jinja -fa -hf bartowski/Qwen2.5-7B-Instruct-GGUF:Q4_K_M
Still fresh / lots of bugs to discover: feedback welcome!
Shoot out to @ggerganov and @ngxson for the patient reviews and general amazing work!
EOTπ§΅.
Llama.cpp now supports tool calling (OpenAI-compatible)
github.com/ggerganov/ll...
On top of generic support for *all* models, it supports 8+ modelsβ native formats:
- Llama 3.x
- Functionary 3
- Hermes 2/3
- Qwen 2.5
- Mistral Nemo
- Firefunction 3
- DeepSeek R1
π§΅ #llamacpp
Shout out to @ggerganov and the amazing contributors to his llama.cpp adventure for creating such a welcoming and technically thrilling project. One of the most rewarding places to invest hobby time in :-)
github.com/ggerganov/ll...
π§΅ 4/4
Note that llama.cpp already has best-in-class JSON schema constraints support, which some already use for tool calling / which my PR builds on (it's been a fun year of PRs!):
github.com/ggerganov/ll...
π§΅ 3/4
Forked this off my PR that brings fully-grammar constrained tool call to *all* models (with native prompting style for a few of them):
github.com/ggerganov/ll...
π§΅ 2/4
Universal llama.cpp tool call is coming: I've just released Minja, a minimalistic Jinja template engine reimplementation in C++ for LLM chat templates:
github.com/google/minja
(*not an official Google product*)
#LLM #AI #EdgeAI #OSS
π§΅ 1/4
To notch it up one bit, you can also specify your own JSON schema (to, say, a list of at between 5 and 10 strings, each conforming to a specific regexp), we've got one of the best support out there
github.com/ggerganov/ll...
A Gyroid model generated by Manifold's SDF LevelSet function.
#Manifold v3.0 is out! This is a huge release - we have removed *all* required dependencies.
Our npm package is half the size and twice the speed. Our #SDF LevelSet is much faster and higher quality.
And so much more: github.com/elalish/mani...