Ivar Flakstad's Avatar

Ivar Flakstad

@ivarf.bsky.social

ML Engineer at ๐Ÿค—

219 Followers  |  151 Following  |  19 Posts  |  Joined: 28.10.2024  |  2.053

Latest posts by ivarf.bsky.social on Bluesky

Howdy all. I'm unfortunately not going to be with my employer for much longer due to team relocation. If anyone has any info on roles that would allow me to continue my Rust compiler work (in New York City), they'd be greatly appreciated.

02.07.2025 17:31 โ€” ๐Ÿ‘ 92    ๐Ÿ” 45    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Preview
Building Tensors From Scratch in Rust: Part 1, Core Structure and Indexing A Blog post by Kyle Birnbaum on Hugging Face

I'm writing an article series about creating tensors from scratch in Rust. #tensors #machine-learning #ml #ai

huggingface.co/blog/KeighBe...

12.06.2025 23:56 โ€” ๐Ÿ‘ 5    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿฆ€ Hello World!

The Rust project now has an official presence on Bluesky! โœจ

We'll be posting the same on our Mastodon and Bluesky accounts, so you won't miss anything on either platform.

05.04.2025 10:51 โ€” ๐Ÿ‘ 1480    ๐Ÿ” 287    ๐Ÿ’ฌ 32    ๐Ÿ“Œ 25
fleetwood.dev fleetwood.dev

Want an in depth exploration of the different hardware architectures within AI?
Of course you do :)
Another great article by Chris Fleetwood:
fleetwood.dev/posts/domain...

09.03.2025 13:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Performance leap: TGI v3 is out. Processes 3x more tokens, 13x faster than vLLM on long prompts. Zero config !

10.12.2024 10:08 โ€” ๐Ÿ‘ 19    ๐Ÿ” 6    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

True, but at the same time my man JJB famously said ยซOoh! Ooh! Mooie! Woohoo! Aah!ยป
So yeah

08.12.2024 13:46 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Chart Title: Model Hardware vs Energy per GigaFLOP.
Vertical Axis: mJ/GFLOP(Log)
Horizontal Axis: Hardware Type(CPU, CPU + GPU, CPU + ANE)
CPU: min 6.9 1st quartile 11.7 median 13.4 3rd quartile 35.6 max 53.1
CPU + GPU: 4.6 4.6 4.7 6.2 9.6
CPU + ANE: 0.9 1.0	1.1 1.4 1.8

Chart Title: Model Hardware vs Energy per GigaFLOP. Vertical Axis: mJ/GFLOP(Log) Horizontal Axis: Hardware Type(CPU, CPU + GPU, CPU + ANE) CPU: min 6.9 1st quartile 11.7 median 13.4 3rd quartile 35.6 max 53.1 CPU + GPU: 4.6 4.6 4.7 6.2 9.6 CPU + ANE: 0.9 1.0 1.1 1.4 1.8

Preliminary data shows the Apple Neural Engine uses ~94% less energy than the CPU and ~75% less than the GPU ๐Ÿคฏ

On the On-Device team at Hugging Face, we've been profiling energy usage for CoreML models. Hereโ€™s some data I collected:

05.12.2024 20:08 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

I, for one, donโ€™t immediately see anything wrong with what youโ€™ve said here.
There are perhaps some exaggerations here and there to drive home your points, but the best thread/rant on the subject (from the side of outraged bluesky users) that Iโ€™ve seen

28.11.2024 17:57 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

It's Sunday morning so taking a minute for a nerdy thread (on math, tokenizers and LLMs) of the work of our intern Garreth

By adding a few lines of code to the base Llama 3 tokenizer, he got a free boost in arithmetic performance ๐Ÿ˜ฎ

[thread]

24.11.2024 11:05 โ€” ๐Ÿ‘ 272    ๐Ÿ” 34    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 5

I guess youโ€™ll have to engage fervently with that content to bring it back. Good luck ๐Ÿซก

21.11.2024 17:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Sky tweet

21.11.2024 08:34 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image 21.11.2024 00:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I was just about to tag you hehe

20.11.2024 21:53 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Nils Olav - Wikipedia

In Norway we let penguins lead ๐Ÿซก

en.m.wikipedia.org/wiki/Nils_Olav

20.11.2024 21:46 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

If you want to dive into async allocators a bit more:
open.spotify.com/episode/2YGI...

20.11.2024 19:13 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Our hardware is usually async in many different ways, but our default programming approach usually isnโ€™t.

For example we approach allocating memory as a sync operation, but it usually isnโ€™t. We could be doing stuff while allocating. Async allocators has a host of fun problems though :)

20.11.2024 19:12 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

when you try to convert your text into smaller pieces but all it gives you is Elvish, thatโ€™s a tolkienizer

20.11.2024 17:50 โ€” ๐Ÿ‘ 1011    ๐Ÿ” 104    ๐Ÿ’ฌ 35    ๐Ÿ“Œ 17
fleetwood.dev fleetwood.dev

RoPE can be confusing, so hereโ€™s a great write up by my buddy Chris Fleetwood on the topic:
fleetwood.dev/posts/you-co...

18.11.2024 15:42 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I think youโ€™re applying your own (better) logic and improving on what he actually means. Your point has merit, his does not.
Hold people accountable to their exact phrasing.

17.11.2024 22:36 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

He specifically said to stop worrying about climate goals and instead funnel money into AI, no?

Thatโ€™s what you should either agree with or not.
Applying AI in various fields is something else. Sounds good.

17.11.2024 20:52 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Generalisations outside what exists/can be inferred in the training set?
That is unfortunately impossible simply by how training works.

I think continuing to fund ML research is essential. But there are a limited amount of geniuses out there. Wild spending will not improve anything.

17.11.2024 20:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Hah I thought you blocked me for agreeing with you because my comments disappeared. Phew.
I guess theyโ€™re gone because the parent comments were removed.

17.11.2024 19:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

MLE here.
I donโ€™t want to put you down or anything, but I think you should look into the specifics a little closer.
It is correct that it can only solve the types of problems it has seen.
If the model is able to generalise beyond that then weโ€™ve achieved AGE. Which we have not.

17.11.2024 19:27 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I work with AI. Specifically the actual implementation of them.
The way LLMs approach to a problem is the exact same approach as finishing a poem.
In other words it does not have the concept of problem solving, it is simply finishing text to the best of its abilities.

17.11.2024 19:23 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Why would I follow you if I didnโ€™t want rants and tangents?
Go ahead :)

17.11.2024 13:41 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It is!
When he first shared the findings some time back I spent some time thinking about how to extract something valuable from it but came up short. All Iโ€™m left with is that itโ€™s fascinating.

I feel like maybe it could tell us something about how to choose optimal precision, but ๐Ÿคท

17.11.2024 13:35 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@ivarf is following 19 prominent accounts