We have Nvidia B200s ready to go for you in Hugging Face Inference Endpoints π₯
I tried them out myself and the performance is amazing.
On top of that we just got a fresh batch of H100s as well. At $4.5/hour it's a clear winner in terms of price/perf compared to the A100.
06.10.2025 08:44 β
π 6
π 1
π¬ 0
π 0
We just refreshed π our analytics in @hf.co
endpoints. More info below!
21.03.2025 17:45 β
π 8
π 3
π¬ 1
π 0
Morning workout at the @hf.co Paris office is imo one of the best perks.
13.03.2025 11:24 β
π 3
π 0
π¬ 0
π 0
Gemma 3 is live π₯
You can deploy it from endpoints directly with an optimally selected hardware and configurations.
Give it a try π
12.03.2025 11:28 β
π 6
π 2
π¬ 1
π 0
Apparently, mom is a better engineer than what I am.
24.12.2024 13:03 β
π 4
π 0
π¬ 0
π 0
https://github.com/ErikKaum/bitbubble
today as part of a course, I implemented a program that takes a bit stream like so:
10001001110111101000100111111011
and decodes the intel 8088 assembly from it like:
mov si, bx
mov bx, di
only works on the mov instruction, register to register.
code: github.com/ErikKaum/bit...
21.12.2024 21:17 β
π 1
π 0
π¬ 0
π 0
YouTube video by Founders, Inc.
before you give up, give this video a chance.
Ambition is a paradox.
You should always aim higher, but that easily becomes a state where you're never satisfied. Just reached 10k MRR. Now there's the next goal of 20k.
Sharif has a good talk on this: emotional runway.
How do you deal with this paradox?
video: www.youtube.com/watch?v=zUnQ...
17.12.2024 10:12 β
π 1
π 0
π¬ 0
π 0
Thereβs some deep wisdom in that as well!
08.12.2024 14:38 β
π 1
π 0
π¬ 0
π 0
Qui Gon Jinn sharing some insightful prompting wisdom ππΌ
08.12.2024 09:03 β
π 11
π 3
π¬ 1
π 0
Exactly.
Suppose we have an algorithm that is guaranteed to give output according to a structure, with the caveat that it might run out of tokens.
Should this still be classified as structured generation?
06.12.2024 09:42 β
π 1
π 0
π¬ 1
π 0
π€
05.12.2024 20:22 β
π 1
π 0
π¬ 0
π 0
CUDA libraries..? So they have access to gpus as well? π
05.12.2024 19:33 β
π 1
π 0
π¬ 1
π 0
A video series on how to develop, profile and compare cuda kernels would be such a banger.
And allow a lot more tinkerers to enter the field.
05.12.2024 19:32 β
π 0
π 0
π¬ 0
π 0
Hell yeah π₯
How would you classify the edge case when running out of tokens?
E.g if it goes into a β\nβ loop and runs out of tokens.
05.12.2024 19:29 β
π 2
π 0
π¬ 1
π 0
Hah, fair!
01.12.2024 11:01 β
π 1
π 0
π¬ 0
π 0
Interesting, for me it's snappy as hell, maybe things aren't cached as well in Costa Rica? π€
01.12.2024 10:15 β
π 0
π 0
π¬ 1
π 0
pro tip for the borrow-checker, using .clone() everywhere is okay π
01.12.2024 10:13 β
π 0
π 0
π¬ 1
π 0
it's this time of the year π
01.12.2024 10:12 β
π 2
π 0
π¬ 0
π 0
Or then you can let the model run free in a constrained environment.
Iβm tinkering on this: bsky.app/profile/erik...
27.11.2024 15:53 β
π 2
π 0
π¬ 0
π 0
Hugging Face inference endpoints now support CPU deployment for llama.cpp π π
Why this is a huge deal? Llama.cpp is well-known for running very well on CPU. If you're running small models like Llama 1B or embedding models, this will definitely save tons of money π° π°
27.11.2024 11:01 β
π 25
π 6
π¬ 3
π 1
Nice! This is so neat ππ½
26.11.2024 22:23 β
π 1
π 0
π¬ 0
π 0
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.
SmolVLM can be fine-tuned on a Google collab and be run on a laptop! Or process millions of documents with a consumer GPU!
26.11.2024 15:57 β
π 104
π 22
π¬ 4
π 4
Is it just me or does it intuitively align that chat bars are at the bottom of the page and search bars at the top?
I've noticed that perplexity positions the question on the top and generates the text below.
Is it because they want to position more as a search engine?
26.11.2024 13:54 β
π 1
π 0
π¬ 0
π 0
The hope if have with Bluesky is that I as a user can do moderation more efficiently than what I could on twitter π€πΌ
25.11.2024 18:18 β
π 12
π 0
π¬ 0
π 0
Feeds and starter packs helped at least me a lot. E.g: bsky.app/profile/did:...
25.11.2024 13:19 β
π 1
π 0
π¬ 1
π 0
Indeed, the beauty of open source π₯
24.11.2024 14:28 β
π 1
π 0
π¬ 0
π 0
Canβt wait to have that feature!
Itβs kinda mind blowing that itβs not a thing on other social media platforms π€·πΌββοΈ
24.11.2024 10:25 β
π 1
π 0
π¬ 1
π 0
code boxes with syntax highlighting π
24.11.2024 10:19 β
π 4
π 0
π¬ 1
π 0