Shining Valiant 3 :) huggingface.co/ValiantLabs/...
07.07.2025 12:13 β π 0 π 0 π¬ 0 π 0@sequelbox.bsky.social
captain of the valiant pirates. https://huggingface.co/sequelbox
Shining Valiant 3 :) huggingface.co/ValiantLabs/...
07.07.2025 12:13 β π 0 π 0 π¬ 0 π 0cobalt 2 coming soon. we have models to build
18.05.2025 00:16 β π 0 π 0 π¬ 0 π 0esper 3 is here :) coding, architecture, DevOps, creative reasoning, general chat. Get it on HF: huggingface.co/ValiantLabs/...
07.05.2025 02:16 β π 0 π 0 π¬ 0 π 0sneak preview is now up on HF!
25.02.2025 00:51 β π 0 π 0 π¬ 0 π 0Tachibana 2 coming ASAP :) using deepseek r1 :) for everyone
22.02.2025 18:31 β π 0 π 0 π¬ 1 π 0Raiden-Deepseek-R1 is here! A deep-dive dataset into the creative, analytic, and reasoning capabilities of Deepseek's R1 model. Available on @hf.co for everyone: huggingface.co/datasets/seq...
11.02.2025 05:16 β π 0 π 0 π¬ 0 π 0next R1 reasoning dataset after Raiden is technical as well. Seems like the type of thing you'd see a new model release along with :) coming soon!
05.02.2025 19:17 β π 0 π 0 π¬ 0 π 0New sneak peek at my first dataset with Deepseek's excellent 685b R1 model - Raiden uses creative and analytical prompts to challenge R1's reasoning skills. For everyone to use: huggingface.co/datasets/seq...
04.02.2025 03:34 β π 0 π 0 π¬ 0 π 0cool, excited to try it out! glad to see apache license :)
31.01.2025 00:42 β π 1 π 0 π¬ 0 π 0upcoming Deepseek-R1 datasets: creative-reasoning and code-reasoning first. SV3 datasets will come too! love what I've seen so far
27.01.2025 22:36 β π 0 π 0 π¬ 0 π 0very impressed with the new deepseek release + distilled versions! Datasets coming soon :)
20.01.2025 18:52 β π 2 π 0 π¬ 0 π 0BIG release by DeepSeek AIπ₯π₯π₯
DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
huggingface.co/deepseek-ai
huggingface.co/deepseek-ai/...
more QVQ datasets on the way soon! The next dataset release uses creative-generalist reasoning prompts.
13.01.2025 05:52 β π 0 π 0 π¬ 0 π 0Tachibana-QVQ is here! Code-reasoning and code-instruct data generated by Qwen's QVQ 72b model. Get it now on @hf.co at huggingface.co/datasets/seq...
07.01.2025 20:51 β π 6 π 1 π¬ 0 π 0Check out the new preview release of Tachibana-QVQ: code-reasoning data created by Qwen's new QVQ-72B-Preview model! Take a look at QVQ's coding ability: huggingface.co/datasets/seq...
30.12.2024 23:23 β π 0 π 0 π¬ 0 π 0* Blog: qwenlm.github.io/blog/qvq-72b...
* HF: huggingface.co/collections/...
* ModelScope: modelscope.cn/models/Qwen/...
* Kaggle: kaggle.com/models/qwen-...
* Demo: huggingface.co/spaces/Qwen/...
Excited to compare it to QwQ 32b!
24.12.2024 19:07 β π 0 π 0 π¬ 0 π 0Find my open-source datasets on @hf.co, including science-instruct, code-instruct, DevOps, general chat, multi-turn and more! For everyone to use. More to come soon!
23.12.2024 22:28 β π 2 π 0 π¬ 0 π 0