's Avatar

@tscholak.bsky.social

Lead Research Scientist @servicenowresearch.bsky.social. All opinions my own.

25 Followers  |  28 Following  |  7 Posts  |  Joined: 11.04.2025  |  1.4837

Latest posts by tscholak.bsky.social on Bluesky

Huge thanks and congrats to the SLAM team and @servicenowresearch.bsky.social ๐Ÿ™Œโค๏ธ
And a special shoutout to Sathwik, best co-lead anyone could ask for.

11.04.2025 20:16 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿง  Researchers: run it
๐Ÿงฐ Engineers: fine-tune it
๐Ÿงช Builders: break it
Tell us what you find.
Apriel-5B models are permissively licensed (MIT) and ready to chat.
#Apriel #LLM #AI #OpenWeights #FastLLM #SLAM #ServiceNow #ServiceNowResearch

11.04.2025 20:15 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Apriel is our proving ground:
๐Ÿงช Fast, cheap, high-quality model training
๐Ÿ“ฆ Compact models that generalize well
This is just the start.

11.04.2025 20:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
GitHub - ServiceNow/Fast-LLM: Accelerating your LLM training to full speed! Made with โค๏ธ by ServiceNow Research Accelerating your LLM training to full speed! Made with โค๏ธ by ServiceNow Research - ServiceNow/Fast-LLM

And we did it with just:
๐Ÿ–ฅ๏ธ 480 x H100s
โฑ๏ธ ~91,000 H100-hours
๐Ÿงฎ 4.8B params, bfloat16
๐Ÿ’ธ 2.3 x fewer GPU hours than OLMo-2-7B
Thanks to Fast-LLM, github.com/ServiceNow/F..., our custom training stack for speed and scale. No hacks. Just better infra.

11.04.2025 20:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿ“Š Benchmarks (lm-eval-harness):
๐Ÿ’ฅ Beats OLMo-2-7B-Instruct and Mistral-Nemo-12B-Instruct on avg
๐Ÿ’ฅ Competitive with LLama-3.1-8B-Instruct, beats it in math benchmarks and IF Eval

11.04.2025 20:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We're releasing:
๐Ÿง  Apriel-5B-Base: pretrained, general-purpose decoder
๐Ÿง‘โ€๐Ÿซ Apriel-5B-Instruct: chat-style variant for aligned outputs
Trained on 4.5T+ tokens.
๐Ÿ‘‰ huggingface.co/ServiceNow-AI/Apriel-5B-Base
๐Ÿ‘‰ huggingface.co/ServiceNow-AI/Apriel-5B-Instruct

11.04.2025 20:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐Ÿšจ SLAM Labs presents Apriel-5B! And it lands right in the green zone ๐Ÿšจ
Speed โšก + Accuracy ๐Ÿ“ˆ + Efficiency ๐Ÿ’ธ
This model punches above its weight, beating bigger LLMs while training on a fraction of the compute.
Built with Fast-LLM, our in-house training stack.
๐Ÿงต๐Ÿ‘‡

11.04.2025 20:14 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2

@tscholak is following 20 prominent accounts