FCLC (London 🦁)'s Avatar

FCLC (London 🦁)

@fclc.bsky.social

HPC, BLAS, I make things FAST
 Standing on the shoulders of giants TLDR; πŸ‡¨πŸ‡¦πŸ§πŸ§‘πŸΌβ€πŸ’»πŸš΄πŸŽοΈπŸ§—πŸΌ πŸ’©posting. Haver of opinions that are all my own. I mainly do #HPC #BLAS #AI #RVV and #clusters Proud French Canadian, you’ll hear about it (I help with HPC.social)

930 Followers  |  555 Following  |  738 Posts  |  Joined: 04.08.2023  |  1.7002

Latest posts by fclc.bsky.social on Bluesky

a white woman with blonde wavy hair wearing a puffy dress is fist bumping a tall red ghostly woman creature

a white woman with blonde wavy hair wearing a puffy dress is fist bumping a tall red ghostly woman creature

me accepting that my demons are a part of me and that they deserve love too

02.08.2025 23:08 β€” πŸ‘ 565    πŸ” 118    πŸ’¬ 3    πŸ“Œ 3
alt-J - In Cold Blood (Official Audio)
YouTube video by alt-J alt-J - In Cold Blood (Official Audio)

youtu.be/vlMePO0qywA?...

03.08.2025 07:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
foreground: bench power supply. it's unremarkable. it's a cheap looking one, the kind you find on aliexpress or ebay or whatever.

background: with the blissful expression only stock photo actors are able to pull off, here's a technican-looking guy, starring off into the mid distance. he's holding a laptop, which he's not looking at. the laptop has some panels removed, exposing its ram and whatever. our guy is *holding a medical stethoscope* against the body of the laptop. thankfully he is wearing safety glasses

foreground: bench power supply. it's unremarkable. it's a cheap looking one, the kind you find on aliexpress or ebay or whatever. background: with the blissful expression only stock photo actors are able to pull off, here's a technican-looking guy, starring off into the mid distance. he's holding a laptop, which he's not looking at. the laptop has some panels removed, exposing its ram and whatever. our guy is *holding a medical stethoscope* against the body of the laptop. thankfully he is wearing safety glasses

dear lady who's holding a soldering iron by the tip, and soldering a motherboard with no solder. i have guy for you

31.07.2025 00:09 β€” πŸ‘ 123    πŸ” 22    πŸ’¬ 12    πŸ“Œ 1

One of the biggest things missing from Bsky continues to be Polls.

I use socials in no small part as a way to aggregate and share information. Polls are *easily* the easiest, lowest friction way of doing that.

@jay.bsky.team please please please can that functionality be added?

25.07.2025 15:46 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

*preparing for RISCV vector, IME and AME meetings*

"Just another Matrix Mondayyyyyyy"🎢

24.07.2025 17:35 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For users of #OpenFOAM, what is your most used solver?

#hpc #CFD

22.07.2025 14:31 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0

πŸ’₯Spack v1.0 is out!πŸ’₯

This is a huge milestone. We reworked the core to add compiler dependencies, and we're introducing a stable package API.

πŸš€1.0 also adds concurrent builds, better includes, and much more -- read it all in the release notes!

github.com/spack/spack/...

20.07.2025 10:45 β€” πŸ‘ 41    πŸ” 16    πŸ’¬ 0    πŸ“Œ 5

Hey folks! Will be in the Bay Area from ~25 of July to 2nd of August! If we can meet up, we should!!

20.07.2025 01:34 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Remember when defragmenting a hard drive was a thing that was important to do and you paid for software to do it?

19.07.2025 12:02 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 3    πŸ“Œ 0
Video thumbnail
19.07.2025 12:14 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Come see us at #PEARC25!

16.07.2025 15:13 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

all constant floats aren't floats James!!!!!
given
float foo = (1/3)*3
and
const float bar = (1/3)*3

foo != bar

!!!!!!!!!!!!!!!!!!!!!!

13.07.2025 00:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I have been convinced that Go is mostly an unserious language that should never have been allowed to escape the lab.

13.07.2025 00:40 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0

Solar power is now California's largest source of electricity. β˜€οΈπŸ”ŒπŸ’‘

12.07.2025 14:21 β€” πŸ‘ 1810    πŸ” 407    πŸ’¬ 24    πŸ“Œ 33

seems intel has since redacted this section...

11.07.2025 16:48 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0

Help a journalist out? I'm looking to talk to researchers, engineers, students or entrepreneurs who are considering leaving the U.S. because they lost funding or are worried about visas or anything like that. I'm at mimsical.94 on Signal

03.07.2025 19:07 β€” πŸ‘ 94    πŸ” 104    πŸ’¬ 2    πŸ“Œ 1
Preview
CppNorth, The Canadian C++ Conference 2025: Why are software engineers so hard to re... View more about this event at CppNorth, The Canadian C++ Conference 2025

CppNorth 2025: Are Devs Obsolete? πŸ€”

Join Heather Crawford & John Pavan: "Why are software engineers so hard to replace?" Explore why tech from COBOL to AI, meant to replace us, has only made developers more essential.
πŸ”— sched.co/21xPC
Tickets: CppNorth.ca
🍁 Join us in Toronto, July 20-23! #cpp #dev

26.06.2025 16:27 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Glenn Posted his ISC recap, and as always, it's excellent. The exert on Mixed precision and Ozaki did have me giggling whilst reading.

I think it's relevant to add some high level context on *why* we vendors decrease FP64; it's not just chasing AI, it's the shear *cost* of FP64.

The short ...

24.06.2025 14:51 β€” πŸ‘ 17    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Request for Information–Future Generation High Performance Computing Center | HPC @ LLNL This website enables public access to Request for Information No. HPC-007 (RFI) pertaining to a Future Generation High Performance Computing Center. The RFI points of contact are LLNS Contract Analyst...

We’ve got a request for information out on where we want to take Livermore Computing and other #HPC centers in the next five years.

hpc.llnl.gov/fg-hpcc-rfi

Check it out and send us your thoughts.

25.06.2025 23:21 β€” πŸ‘ 14    πŸ” 8    πŸ’¬ 2    πŸ“Œ 0
Preview
GitHub - tenstorrent/tt-npe: A simple network-on-chip performance estimator (NPE) for Tenstorrent Tensix-based devices A simple network-on-chip performance estimator (NPE) for Tenstorrent Tensix-based devices - tenstorrent/tt-npe

github.com/tenstorrent/...

24.06.2025 22:21 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

WAIT

24.06.2025 22:21 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
picture of me holding a box containing a Tenstorrent Blackhole accelerator card

picture of me holding a box containing a Tenstorrent Blackhole accelerator card

picture of Tenstorrent Blackhole accelerator card on my desk

picture of Tenstorrent Blackhole accelerator card on my desk

i am going to take @tenstorrent.bsky.social’s beautiful matrix-multiplying machine and make it do bizarre and horrible things

24.06.2025 19:10 β€” πŸ‘ 55    πŸ” 1    πŸ’¬ 4    πŸ“Œ 0

The answer is decoupling the operation from where it’s run.

We can continue to run FP64 DGEMM on vector for those that need it; we’ve been doing it for decades.

If you have a need for β€œmore than fp32”, Ozaki to the rescue!

Part of the power of the scheme is selecting how much precision you need

24.06.2025 21:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Depends on training requirements; training still needs some FP32, even if only for accumulation!

24.06.2025 21:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
ISC'25 recap I had the pleasure of attending the 40th annual ISC High Performance conference this month in Hamburg, Germany. It was a delightful way to t...

Here's my notes from ISC'25 in Hamburg: blog.glennklockwood.com/2025/06/isc2...

#HPC

24.06.2025 06:01 β€” πŸ‘ 32    πŸ” 7    πŸ’¬ 7    πŸ“Œ 2

A nice summary, and a call to action, from the organisers of the PASC'25 Mini-Symposium on Application Perspective on SYCL, a Modern Programming Model for Performance and Portability
youtu.be/4K612eNB6cI
#APEROL #SYCL

24.06.2025 16:30 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Satoshi said this during his talk, but his explanation felt overly simplistic, and I didn't fully buy it. Felix's description makes more sense to me. This quadratic relationship between precision and floorspace is brutal.

#HPC

24.06.2025 16:15 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

But what you net out is you're still loosing out on performance/area vs other designs that don't.

Balance in all things is relevant, but I can't put on FP64 (or FP128) just for the sake of it.

i suspect the trend we'll see is FP64 (sometimes FP128) on scalars, FP32-64 on Vectors, =<FP32 matrix

24.06.2025 14:51 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

You have techniques for getting some of that back (for example, muxing have the multipliers, giving an extra scalar unit in the middle of your multiplier tree etc. ) that would allow you to fit 2 FP32s in the existing FP64 unit while only costing 10% more.

you can play the same tricks with FP16

24.06.2025 14:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

... Version is that, doubling the precision of a given FPU quadruples the amount of area needed for that FPU.

That's to say, to fit 1 FP64 unit, I need to give up 4 FP32s. I need to give up 16 FP16/BF16/TF19s

and compared to MXFP8? Aprox factor 64.

24.06.2025 14:51 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 2    πŸ“Œ 1

@fclc is following 20 prominent accounts