Huge thanks and congrats to the SLAM team and @servicenowresearch.bsky.social ๐โค๏ธ
And a special shoutout to Sathwik, best co-lead anyone could ask for.
11.04.2025 20:16 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
๐ง Researchers: run it
๐งฐ Engineers: fine-tune it
๐งช Builders: break it
Tell us what you find.
Apriel-5B models are permissively licensed (MIT) and ready to chat.
#Apriel #LLM #AI #OpenWeights #FastLLM #SLAM #ServiceNow #ServiceNowResearch
11.04.2025 20:15 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Apriel is our proving ground:
๐งช Fast, cheap, high-quality model training
๐ฆ Compact models that generalize well
This is just the start.
11.04.2025 20:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
GitHub - ServiceNow/Fast-LLM: Accelerating your LLM training to full speed! Made with โค๏ธ by ServiceNow Research
Accelerating your LLM training to full speed! Made with โค๏ธ by ServiceNow Research - ServiceNow/Fast-LLM
And we did it with just:
๐ฅ๏ธ 480 x H100s
โฑ๏ธ ~91,000 H100-hours
๐งฎ 4.8B params, bfloat16
๐ธ 2.3 x fewer GPU hours than OLMo-2-7B
Thanks to Fast-LLM, github.com/ServiceNow/F..., our custom training stack for speed and scale. No hacks. Just better infra.
11.04.2025 20:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
๐ Benchmarks (lm-eval-harness):
๐ฅ Beats OLMo-2-7B-Instruct and Mistral-Nemo-12B-Instruct on avg
๐ฅ Competitive with LLama-3.1-8B-Instruct, beats it in math benchmarks and IF Eval
11.04.2025 20:15 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
We're releasing:
๐ง Apriel-5B-Base: pretrained, general-purpose decoder
๐งโ๐ซ Apriel-5B-Instruct: chat-style variant for aligned outputs
Trained on 4.5T+ tokens.
๐ huggingface.co/ServiceNow-AI/Apriel-5B-Base
๐ huggingface.co/ServiceNow-AI/Apriel-5B-Instruct
11.04.2025 20:14 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
๐จ SLAM Labs presents Apriel-5B! And it lands right in the green zone ๐จ
Speed โก + Accuracy ๐ + Efficiency ๐ธ
This model punches above its weight, beating bigger LLMs while training on a fraction of the compute.
Built with Fast-LLM, our in-house training stack.
๐งต๐
11.04.2025 20:14 โ ๐ 4 ๐ 2 ๐ฌ 1 ๐ 2
Stanford Linguistics and Computer Science. Director, Stanford AI Lab. Founder of @stanfordnlp.bsky.social . #NLP https://nlp.stanford.edu/~manning/
Climate & AI Lead @HuggingFace, TED speaker, WiML board member, TIME AI 100 (She/her/Dr/๐ฆ)
So far I have not found the science, but the numbers keep on circling me.
Views my own, unfortunately.
Professor, Programmer in NYC.
Cornell, Hugging Face ๐ค
The AI community building the future!
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuffโฆ
I like tokens! Lead for OLMo data at @ai2.bsky.social (Dolma ๐) w @kylelo.bsky.social. Open source is fun ๐คโ๏ธ๐๐ณ๏ธโ๐ Opinions are sampled from my own stochastic parrot
more at https://soldaini.net
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
Working towards the safe development of AI for the benefit of all at Universitรฉ de Montrรฉal, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/
Unlock work experiences of the future. Join ServiceNow Research as we advance the state-of-the-art in Enterprise AI. https://www.servicenow.com/research/ #ServiceNowResearch #LifeAtNow #Hiring
Passionate about AI and its impact on society โข VP, Research, ServiceNow โข Associate industrial member, Mila โข Adjunct professor, Polytechnique Montrรฉal โข Co-Founder, Imagia & Element AI
Researcher, coder, entrepreneur, kind. CS PhD, ex-Google, ElementAI co-founder. En franรงais: @fr.beaudoin.social Current project: https://numeno.ai #FreeOurFeeds
Assistant Professor @Mila-Quebec.bsky.social
Co-Director @McGill-NLP.bsky.social
Researcher @ServiceNow.bsky.social
Alumni: @StanfordNLP.bsky.social, EdinburghNLP
Natural Language Processor #NLProc
Research Scientist at DeepMind. Opinions my own. Inventor of GANs. Lead author of http://www.deeplearningbook.org . Founding chairman of www.publichealthactionnetwork.org
AI x storytelling
AI Engineering: https://amazon.com/dp/1098166302
Designing ML Systems: http://amazon.com/dp/1098107969
@chipro
Applied Research Scientist working on LLMs at @ServiceNow. Opinions are my own.
Sr Mgr & Research Scientist @ServiceNowRSRCH, Montreal
AI Ecosystem Director @ServiceNow @ServiceNowResearch @BigCodeProject #TheAIAlliance - formerly @IntelAI @ActianCorp @HPE - All posts are my own opinion.
Visiting Researcher at @ServiceNowRSRCH | PhD student in @mcgillu and @Mila_Quebec | Prev. @RecursionPharma
https://aarashfeizi.github.io/