Erik's Avatar

Erik

@erikkaum.bsky.social

SWE @hf.co

821 Followers  |  133 Following  |  37 Posts  |  Joined: 31.10.2024  |  2.1402

Latest posts by erikkaum.bsky.social on Bluesky

Preview
The New and Fresh analytics in Hugging Face Endpoints Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co/blog/endpoin...

21.03.2025 17:46 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

We just refreshed ๐Ÿ‹ our analytics in @hf.co
endpoints. More info below!

21.03.2025 17:45 โ€” ๐Ÿ‘ 8    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Morning workout at the @hf.co Paris office is imo one of the best perks.

13.03.2025 11:24 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Deploy google/gemma-3-27b-it | Inference Endpoints by Hugging Face Deploy google/gemma-3-27b-it for image-text-to-text in 1 click.

Link to deploy:
endpoints.huggingface.co/huggingface/...

12.03.2025 11:29 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

Gemma 3 is live ๐Ÿ”ฅ

You can deploy it from endpoints directly with an optimally selected hardware and configurations.

Give it a try ๐Ÿ‘‡

12.03.2025 11:28 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Apparently, mom is a better engineer than what I am.

24.12.2024 13:03 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
https://github.com/ErikKaum/bitbubble

today as part of a course, I implemented a program that takes a bit stream like so:

10001001110111101000100111111011

and decodes the intel 8088 assembly from it like:

mov si, bx
mov bx, di

only works on the mov instruction, register to register.

code: github.com/ErikKaum/bit...

21.12.2024 21:17 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
before you give up, give this video a chance.
YouTube video by Founders, Inc. before you give up, give this video a chance.

Ambition is a paradox.

You should always aim higher, but that easily becomes a state where you're never satisfied. Just reached 10k MRR. Now there's the next goal of 20k.

Sharif has a good talk on this: emotional runway.

How do you deal with this paradox?

video: www.youtube.com/watch?v=zUnQ...

17.12.2024 10:12 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thereโ€™s some deep wisdom in that as well!

08.12.2024 14:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Qui Gon Jinn sharing some insightful prompting wisdom ๐Ÿ‘Œ๐Ÿผ

08.12.2024 09:03 โ€” ๐Ÿ‘ 11    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Exactly.

Suppose we have an algorithm that is guaranteed to give output according to a structure, with the caveat that it might run out of tokens.

Should this still be classified as structured generation?

06.12.2024 09:42 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿค”

05.12.2024 20:22 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

CUDA libraries..? So they have access to gpus as well? ๐Ÿ‘€

05.12.2024 19:33 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A video series on how to develop, profile and compare cuda kernels would be such a banger.

And allow a lot more tinkerers to enter the field.

05.12.2024 19:32 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Hell yeah ๐Ÿ”ฅ

How would you classify the edge case when running out of tokens?

E.g if it goes into a โ€\nโ€ loop and runs out of tokens.

05.12.2024 19:29 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Hah, fair!

01.12.2024 11:01 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Interesting, for me it's snappy as hell, maybe things aren't cached as well in Costa Rica? ๐Ÿค”

01.12.2024 10:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

pro tip for the borrow-checker, using .clone() everywhere is okay ๐Ÿ™Œ

01.12.2024 10:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

it's this time of the year ๐Ÿ˜

01.12.2024 10:12 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Or then you can let the model run free in a constrained environment.

Iโ€™m tinkering on this: bsky.app/profile/erik...

27.11.2024 15:53 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Hugging Face inference endpoints now support CPU deployment for llama.cpp ๐Ÿš€ ๐Ÿš€

Why this is a huge deal? Llama.cpp is well-known for running very well on CPU. If you're running small models like Llama 1B or embedding models, this will definitely save tons of money ๐Ÿ’ฐ ๐Ÿ’ฐ

27.11.2024 11:01 โ€” ๐Ÿ‘ 23    ๐Ÿ” 6    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 1

Nice! This is so neat ๐Ÿ™Œ๐Ÿฝ

26.11.2024 22:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

SmolVLM can be fine-tuned on a Google collab and be run on a laptop! Or process millions of documents with a consumer GPU!

26.11.2024 15:57 โ€” ๐Ÿ‘ 104    ๐Ÿ” 22    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 4
Post image

Is it just me or does it intuitively align that chat bars are at the bottom of the page and search bars at the top?

I've noticed that perplexity positions the question on the top and generates the text below.

Is it because they want to position more as a search engine?

26.11.2024 13:54 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The hope if have with Bluesky is that I as a user can do moderation more efficiently than what I could on twitter ๐Ÿคž๐Ÿผ

25.11.2024 18:18 โ€” ๐Ÿ‘ 12    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Feeds and starter packs helped at least me a lot. E.g: bsky.app/profile/did:...

25.11.2024 13:19 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Indeed, the beauty of open source ๐Ÿ”ฅ

24.11.2024 14:28 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Canโ€™t wait to have that feature!

Itโ€™s kinda mind blowing that itโ€™s not a thing on other social media platforms ๐Ÿคท๐Ÿผโ€โ™‚๏ธ

24.11.2024 10:25 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

code boxes with syntax highlighting ๐Ÿ˜

24.11.2024 10:19 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

typical engineer writing copy

in plain english i'd say "2 conversions at the same time"

23.11.2024 17:38 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@erikkaum is following 20 prominent accounts