Dagfinn Parnas's Avatar

Dagfinn Parnas

@elsewhat.bsky.social

Architecture and emerging technologies. Soft spot for local llms and multi-agent scenarios

83 Followers  |  129 Following  |  11 Posts  |  Joined: 30.06.2023  |  1.7332

Latest posts by elsewhat.bsky.social on Bluesky

Nice, Silkworm next?

10.02.2025 21:57 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

My favorite books read in 2024

18.12.2024 18:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes, the error is on the ollama setup of the model as far as I can see. In some cases ollama issues can also have a root in llama.cpp

06.12.2024 13:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Add stop word <|endoftext|> to qwq models Β· Issue #7967 Β· ollama/ollama What is the issue? The qwq models currently go into an infinite loop. The reasons for this appears that the model outputs <|endoftext|> at the end of its response, but ollama does not handle this a...

Created bug report to ollama now
github.com/ollama/ollam...

06.12.2024 11:05 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Same thing I saw. Basically Ollama doesn't stop the llm when the model indicates it's done through the <|endoftext|> token.

Fixed for me through the custom model file link to above (which can be imported through ollama create qwk-fix-stop:latest -f qwq-fix-stop-modelfile.md
FROM qwq:latest)

05.12.2024 21:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Think the ollama modelfile for qwq is missing a stopword for <|endoftext|>

See of this helps

github.com/elsewhat/adv...

05.12.2024 21:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1
Advent of Code 2024

Testing out qwq local llm against adventofcode.com

The chain of thought reasoning of this 32b model is massively impressive.

See for example
github.com/elsewhat/adv...

Tested out day 1,2,3 and all were solved correctly on first attempt

05.12.2024 18:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great planning for the fourth session of our bouvet internal architecture school.l Focuses on architecture in challenging deliveries through
1. Real life story telling
2. How to get back on track from delivery leads
3. Identify the signals
4. Incident mgmt and post mortems
5. Secure architecture

05.12.2024 13:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Wind and Truth (The Stormlight Archive, #5) The long-awaited explosive climax to the first arc of t…

Only two more days till the release of the final book in Brandon Sandersons Stormlight archives series
www.goodreads.com/book/show/20...

04.12.2024 19:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
qwq QwQ is an experimental research model focused on advancing AI reasoning capabilities.

In awe of Qwq 32b model reasoning skills and chain of thoughts. Q4 runs fully in memory on my local 4090 gpu with great speed.

Plan to test some more on the advent of code tasks

ollama.com/library/qwq

04.12.2024 19:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Alt+F4 on x account.
Ahhh and now a fresh start

04.12.2024 19:14 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@elsewhat is following 20 prominent accounts