Maximilian Scholz's Avatar

Maximilian Scholz

@scholzmx.bsky.social

PhD researcher, building infrastructure (and pokemon) for Bayesian workflows, simulating everything. Music, cooking, exercise enthusiast. http://fediscience.org/@scholzmx

606 Followers  |  183 Following  |  1,122 Posts  |  Joined: 03.10.2023
Posts Following

Posts by Maximilian Scholz (@scholzmx.bsky.social)

Astra... Show me on this Mac mini where the ai agent touched you.

02.03.2026 13:21 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I feel like over the last 2 months I've slowly slipped into a place where I just do things rather than trying to get others to see why it would be a good thing to do things. It's liberating.

28.02.2026 12:29 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Richard, what did you do to the owls?

28.02.2026 12:26 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If you bitshift this data.frame just right, the serialised binary can be interpreted as NN weights

27.02.2026 15:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
┏━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Model         ┃ Tasks ┃ Passed ┃ Pass Rate ┃ Avg Latency ┃ Input (M) ┃ Output (M) ┃ Cache (M) ┃ Total Cost ┃
┑━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━┩
β”‚ glm-5         β”‚   206 β”‚     27 β”‚     13.1% β”‚       91.1s β”‚     0.00M β”‚      0.00M β”‚     0.00M β”‚      $0.00 β”‚
β”‚ gpt-oss-120b  β”‚   205 β”‚     32 β”‚     15.6% β”‚       68.3s β”‚     2.35M β”‚      0.20M β”‚     0.00M β”‚      $0.33 β”‚
β”‚ minimax-m2.5  β”‚   205 β”‚     30 β”‚     14.6% β”‚       96.3s β”‚    14.11M β”‚      0.15M β”‚     0.84M β”‚      $4.43 β”‚
β”‚ nemotron-3-n… β”‚   206 β”‚     27 β”‚     13.1% β”‚       90.7s β”‚     0.00M β”‚      0.00M β”‚     0.00M β”‚      $0.00 β”‚
β”‚ step-3.5-fla… β”‚   206 β”‚     27 β”‚     13.1% β”‚       61.5s β”‚     0.00M β”‚      0.00M β”‚     0.00M β”‚      $0.00 β”‚
β”‚ trinity-larg… β”‚   170 β”‚     78 β”‚     45.9% β”‚      367.3s β”‚    53.65M β”‚      1.73M β”‚     0.00M β”‚     $15.14 β”‚
β”‚ trinity-mini  β”‚   204 β”‚     57 β”‚     27.9% β”‚      115.7s β”‚     1.58M β”‚      1.83M β”‚     0.00M β”‚      $0.35 β”‚
└───────────────┴───────┴────────┴───────────┴─────────────┴───────────┴────────────┴───────────┴────────────

┏━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━┓ ┃ Model ┃ Tasks ┃ Passed ┃ Pass Rate ┃ Avg Latency ┃ Input (M) ┃ Output (M) ┃ Cache (M) ┃ Total Cost ┃ ┑━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━┩ β”‚ glm-5 β”‚ 206 β”‚ 27 β”‚ 13.1% β”‚ 91.1s β”‚ 0.00M β”‚ 0.00M β”‚ 0.00M β”‚ $0.00 β”‚ β”‚ gpt-oss-120b β”‚ 205 β”‚ 32 β”‚ 15.6% β”‚ 68.3s β”‚ 2.35M β”‚ 0.20M β”‚ 0.00M β”‚ $0.33 β”‚ β”‚ minimax-m2.5 β”‚ 205 β”‚ 30 β”‚ 14.6% β”‚ 96.3s β”‚ 14.11M β”‚ 0.15M β”‚ 0.84M β”‚ $4.43 β”‚ β”‚ nemotron-3-n… β”‚ 206 β”‚ 27 β”‚ 13.1% β”‚ 90.7s β”‚ 0.00M β”‚ 0.00M β”‚ 0.00M β”‚ $0.00 β”‚ β”‚ step-3.5-fla… β”‚ 206 β”‚ 27 β”‚ 13.1% β”‚ 61.5s β”‚ 0.00M β”‚ 0.00M β”‚ 0.00M β”‚ $0.00 β”‚ β”‚ trinity-larg… β”‚ 170 β”‚ 78 β”‚ 45.9% β”‚ 367.3s β”‚ 53.65M β”‚ 1.73M β”‚ 0.00M β”‚ $15.14 β”‚ β”‚ trinity-mini β”‚ 204 β”‚ 57 β”‚ 27.9% β”‚ 115.7s β”‚ 1.58M β”‚ 1.83M β”‚ 0.00M β”‚ $0.35 β”‚ └───────────────┴───────┴────────┴───────────┴─────────────┴───────────┴────────────┴───────────┴────────────

Well, some parts are working and some not so much but we are measuring *something*. Now on to make it better and figure out how to fund running this on non-free models...

24.02.2026 20:33 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Because I stumbled over o/u stuff again. My friggin pipe!

23.02.2026 22:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Randomly stumbled over this again and am reminded again why "30 days of bleed" is the best title for any blog post written about #over/under

23.02.2026 22:39 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Cover for Pumpkin Spice a TTRPG showing a witch pouring coffee in a pumpkin spice coffee cup.

Cover for Pumpkin Spice a TTRPG showing a witch pouring coffee in a pumpkin spice coffee cup.

So proud to announce my collaboration on "Pumpkin Spice" a cozy, magical TTRPG, published by #AcheronGames

Check it out on BackerKit (link in bio) and discover all the amazing perks of backing!

Pour yourself a cup of tea and enjoy the vibe!πŸͺ΄πŸ΅

#PumpkinSpiceRPG

23.02.2026 19:32 β€” πŸ‘ 2588    πŸ” 496    πŸ’¬ 30    πŸ“Œ 13
Screenshot of a terminal window showing benchmark results:

────────────────────────── Model Performance ──────────────────────────
┏━━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━━┓
┃ Model          ┃ Tasks ┃ Passed ┃ Pass Rate ┃ Avg Latency ┃
┑━━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━━┩
β”‚ minimax-m2.5   β”‚   106 β”‚     15 β”‚     14.2% β”‚      102.9s β”‚
β”‚ step-3.5-flash β”‚   194 β”‚     26 β”‚     13.4% β”‚       60.5s β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

Screenshot of a terminal window showing benchmark results: ────────────────────────── Model Performance ────────────────────────── ┏━━━━━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━┳━━━━━━━━━━━━━┓ ┃ Model ┃ Tasks ┃ Passed ┃ Pass Rate ┃ Avg Latency ┃ ┑━━━━━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━╇━━━━━━━━━━━━━┩ β”‚ minimax-m2.5 β”‚ 106 β”‚ 15 β”‚ 14.2% β”‚ 102.9s β”‚ β”‚ step-3.5-flash β”‚ 194 β”‚ 26 β”‚ 13.4% β”‚ 60.5s β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

I was supposed to submit a Stan design proposal but got sidetracked... Also, it's a good thing that api keys can have limits... so I've heard.

23.02.2026 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

porngrind never left

23.02.2026 15:28 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Turns out unpaid overtime costs us around 400k jobs in Germany.

23.02.2026 12:36 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Started working on my own R-focused llm benchmark and all of a sudden the zai pro plan didn't feel infinite anymore >.>

21.02.2026 21:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thinking that more companies need a jester. Someone who is allowed to make fun of the king and point at all the ugly and dumb things everyone else silently agreed on not talking about. Someone who can call out the invisible clothes of the emperor and not be lynched but taken serious instead.

21.02.2026 13:21 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Don't be shy.

21.02.2026 11:55 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@hadley.nz I have something in mind I want to tinker with and need some showcase packages for that :)

20.02.2026 22:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What R packages have the highest code quality and should serve as examples to learn from in your opinion? #rstats

20.02.2026 22:09 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 2    πŸ“Œ 1

I'd have a couch to crash on for 2 ;)

20.02.2026 22:03 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

You should just come visit Hamburg :*

20.02.2026 20:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Not to throw shade but not being exclusively surrounded by high-achievers for the first time since high-school is doing wonders to my mental health. Also, many recurring comments from the past make much more sense now.

19.02.2026 08:47 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

If the pen is mightier than the sword, the eraser is by extension also mightier. Few understand this.

17.02.2026 16:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Mam, this is an eraser.

17.02.2026 15:56 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It's also kinda slow and they quietly removed the access to new frontier models point from the plus plan.

11.02.2026 22:15 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

TIL about the narcissist-aspie trap πŸ‘€

11.02.2026 22:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That's why I only work with raw commit hashes.

11.02.2026 16:51 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We have the industry to see methods of times past in the wild. It's like Jurassic Park.

10.02.2026 17:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What kind of crazy are talking? Do I get boons?

09.02.2026 23:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Is it?
github.com/discourse/di...

09.02.2026 23:14 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Screenshot of Claude writing: "why is my PATH fucked?"

Screenshot of Claude writing: "why is my PATH fucked?"

mood

09.02.2026 20:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well you found me, congratulations.
Was it worth it?
The only thing you've managed to break so far, is my heart
This isn't brave.
It's murder.
What did I ever do to you?
You don't even care, do you?
Please proceed into android hell.

09.02.2026 10:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
A black and white drawing of a corpse that reads β€œIf I just try harder, then maybe it won’t kill me.”

A black and white drawing of a corpse that reads β€œIf I just try harder, then maybe it won’t kill me.”

08.02.2026 21:00 β€” πŸ‘ 1181    πŸ” 270    πŸ’¬ 4    πŸ“Œ 3