Michael Griffiths @griffiths.ai

Every line of dialogue in that movie is important to the plot. It's amazing. Fabulous movie.

21.07.2025 02:12 — 👍 3 🔁 0 💬 0 📌 0

Shape constrained splines are nice too, as a way to introduce theory-based constraints.

08.07.2025 13:00 — 👍 2 🔁 0 💬 0 📌 0

This is a fairly technical but highly relevant paper on how we can model complex systems at various levels of detail without losing causal content. Think gas: instead of tracking every molecule, we can focus on big-picture properties like temperature and pressure. www.auai.org/uai2017/proc...

08.07.2025 09:30 — 👍 151 🔁 33 💬 9 📌 3

Damn right.

07.07.2025 15:44 — 👍 0 🔁 0 💬 0 📌 0

From the outside, it just looks like Llama missed a trick focusing on MoE instead of reasoning. A bunch of people used that as evidence that the strategy was bad and convinced Z to blow things up. So far the announced moves have been an incoherent land grab at Z's expense.

03.07.2025 01:02 — 👍 0 🔁 0 💬 0 📌 0

Yeah, the degrowth stuff needs to be killed thoroughly and without mercy. No.

03.07.2025 00:01 — 👍 6 🔁 0 💬 0 📌 0

While true, these tech advances generally benefit society. The Luddites lost their jobs and their kids made less money for generations - but cheaper textiles was a huge benefit to many others.

18.06.2025 23:56 — 👍 0 🔁 0 💬 0 📌 0

My experience is that it felt like 4 for the first year.

15.06.2025 00:44 — 👍 1 🔁 0 💬 0 📌 0

I should! I just finished a P-spec for a system that looked something like this -- generate P spec from code, use that to find errors, turn error traces into tests in project language. You need to be a little careful but it's very fun.

02.06.2025 19:42 — 👍 1 🔁 0 💬 0 📌 0

Validity In Psychology Research: Types & Examples In psychology research, validity refers to the extent to which a test or measurement tool accurately measures what it's intended to measure. It ensures that the research findings are genuine and not d...

I took a number of methods/stats courses in undergrad, and the *only* discipline that covered this was, oddly, psychology. There, they spent a lot of time on validity (e.g. www.simplypsychology.org/validity.html) and then introduced stats as part of it.

30.05.2025 18:17 — 👍 1 🔁 0 💬 0 📌 0

It's great. Sonnet 3.7/4 is also good. I've been using TLA+ for months now to check my own code, and to explain system dynamics in e.g. bug reports. Growing adoption should increase your target market, not decrease it.

30.05.2025 18:17 — 👍 2 🔁 0 💬 1 📌 0

Systems Correctness Practices at Amazon Web Services – Communications of the ACM

Great to see this (cacm.acm.org/practice/sys...) from @marcbrooker.bsky.social

My experience is that LLMs make using TLA+ much easier. For instance, I just wrote up a bug report last week that outlined the old/buggy behavior with a TLA+ spec. It made the dynamics much clearer.

30.05.2025 13:57 — 👍 1 🔁 0 💬 0 📌 0

Reminds me of "in the long run, we're all dead "

30.05.2025 00:21 — 👍 1 🔁 0 💬 0 📌 0

(Not always true, sometimes it's much harder)

20.04.2025 00:42 — 👍 2 🔁 0 💬 0 📌 0

Yeah, that's true. Verification is often much easier however.

20.04.2025 00:42 — 👍 3 🔁 0 💬 1 📌 0

This is one of the great uses of LLMs, though. Turns into (often) a 2-minute job

20.04.2025 00:28 — 👍 3 🔁 0 💬 1 📌 0

Mmmm, they don't. Even if they lose money on average per query, that's dominated by long outlier conversations.

27.01.2025 23:37 — 👍 1 🔁 0 💬 0 📌 0

I agree it's a big deal. I grew up in Santa Rosa and recall the fire that burned down >7% of the city housing stock. Drive through neighbors and see melted cars and lone brick chimneys. It's sad to see the same kind of thing in LA

09.01.2025 01:24 — 👍 1 🔁 0 💬 0 📌 0

I do think licensing limits available staff and drives compensation up, so provider cost seems like a part of it. The important takeaway is the incentive structure of insurance conglomerates.

13.12.2024 14:34 — 👍 1 🔁 0 💬 0 📌 0

Interesting take that capped profit structure pushed insurance companies into self-dealing and reduced cost control incentive to maximize profit

13.12.2024 14:28 — 👍 1 🔁 0 💬 1 📌 0

Reinforcement Learning: An Overview This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based met...

An updated intro to reinforcement learning by Kevin Murphy: arxiv.org/abs/2412.05265! Like their books, it covers a lot and is quite up to date with modern approaches. It also is pretty unique in coverage, I don't think a lot of this is synthesized anywhere else yet

09.12.2024 14:27 — 👍 270 🔁 73 💬 9 📌 5

How much money do we think means testing this is going to save? Like, how many ebike purchases would there have been for people making >3x poverty level?

08.12.2024 02:37 — 👍 1 🔁 0 💬 1 📌 0

Yeah. And insurance company policy changes get attention, but not the fraudulent billing or practices that lead to insurance companies cracking down.

05.12.2024 18:49 — 👍 1 🔁 0 💬 0 📌 0

Nice to see alternatives to TLA+ pop up - this case study of Fizzbee makes it looks like a nice contender for certain use cases.

05.12.2024 17:41 — 👍 0 🔁 0 💬 0 📌 0

Depends if you want to make the declaration of the variable more obvious. You can do

result <-
df |> f() |> g() |> h()

or

result <- (
df |> f() |> g() |> h()
)

... when you want people to focus on the `result` variable vs. the chain logic.

04.12.2024 21:00 — 👍 0 🔁 0 💬 1 📌 0

Quite - the lack of empathy to his death is something that bothers me.

04.12.2024 20:58 — 👍 1 🔁 0 💬 0 📌 0

Each transaction inside DSQL runs in a customized Postgres engine inside a Firecracker MicroVM, dedicated to your database. When you connect to DSQL, we make sure there are enough of these MicroVMs to serve your load, and scale up dynamically if needed. We add MicroVMs in the AZs and regions your connections are coming from, keeping your SQL query processor engine as close to your client as possible to optimize for latency6.

Neat! brooker.co.za/blog/2024/12...

04.12.2024 18:33 — 👍 1 🔁 0 💬 0 📌 0

Posting some evergreens for the new crowd. Did you now you can differentiate RANSAC?

If you fix the # of iterations, RANSAC is an argmax over hypotheses. You turn the inlier count into your policy for hypothesis selection, and train with policy gradient (DSAC, CVPR17).

github.com/vislearn/DSA...

28.11.2024 15:42 — 👍 79 🔁 7 💬 2 📌 1

Yes! My quality of life foes way down when FRED doesn't have something and I have to try to extract it from Eurostat or the OECD. Or even BLS for things FRED doesn't pick up

28.11.2024 11:57 — 👍 438 🔁 23 💬 16 📌 3

That feels like a timing thing? eg you built your workflow and became comfortable with it prior to LLMs, so of course there is no gap.

28.11.2024 10:45 — 👍 0 🔁 0 💬 0 📌 0

Michael Griffiths

Latest posts by griffiths.ai on Bluesky

@griffiths.ai is following 20 prominent accounts