Stephen Martin's Avatar

Stephen Martin

@stephensrmmartin.bsky.social

Probabilistic modeling and machine learning nerd Social Psych PhD, quantitative methods Data Scientist Manager, Comscore - Innovation Research Team.

252 Followers  |  611 Following  |  64 Posts  |  Joined: 25.09.2023  |  1.9587

Latest posts by stephensrmmartin.bsky.social on Bluesky

Llama 3.1 was an excellent local model for its time. That was very well received in the local model world.
But Gemma, Mistral models, and now Qwen models have far surpassed it.

Llama 4 was DOA. Too big for most local model users. Too small if you're going to use an API anyway.
Qwen has the ๐Ÿ‘‘ now.

11.08.2025 17:59 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It lets you move up a level of abstraction for better or worse. I am more able now to be a broader ideas person, which is heavily informed by years of experience and expertise in bespoke stats, analysis, modeling, and coding, etc. Still solving problems, but in a less nitty gritty, more support way.

05.08.2025 03:52 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Some memory issues are too stubborn for post-boot memtesters. Though I had one that was awful - Never manifested in memtest86, it only manifest when the machine was under heavy load, so I had to boot minimally and stress test while checking.
Do you have XMP/EXPO/DCOP enabled?

04.02.2025 04:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Have you done a proper memtest?
Filesystem errors + freezing sounds like memory could be a culprit.

04.02.2025 03:52 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Bah nvm. As soon as I posted this I saw you took the same screenshot haha.

25.01.2025 20:38 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

From their own GitHub - I suspect it doesn't use one.

25.01.2025 20:38 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

It's still my gold standard. If it doesn't work in Stan, it is unlikely to work at all.
It's the perfect level of abstraction. Not so abstract that it's hard to know what it's really doing, not so low level that you have to use a ton of boilerplate and specify every little thing. It's transparent.

11.01.2025 03:23 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

But seriously, I cannot imagine a world where Disney would be ok with a model trained on Disney content being used to generate Disney like content. It could actually spur some changes in what is acceptable use of copyrighted works.
I'm not anti AI, but obviously there are problems with copyright.

29.12.2024 18:04 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

If someone made LLM or image generation models that just produced Disney-like scripts and characters, would *that* suddenly spur some legislation behind acceptable use of copyrighted material to protect artists, writers, bloggers, coders, etc?
Use the system to fight the system?

29.12.2024 17:59 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

This boss just casually breaking a billion in Jaws pinball. Didn't even care for the applause at the end. How far has this gent actually gotten??

30.11.2024 04:32 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

MELSM**

24.11.2024 20:17 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I'd like to thank everyone who made a starter pack. The stats/academic/ML research community was so strong on the site that shall not be named. I hadn't worked up the energy to find everyone from that community, but now bsky is looking a whole lot like the best of times on Xitter.

24.11.2024 20:11 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Likewise - use location scale modeling methods if homogeneity is wrong. LSMs, MELMS, LM-MELSMs, etc. But this is much easier with Bayes. Use wide-tail distributions. There are a lot of ways that models can be improved to better match the apparent DGP without even getting into process changes.

24.11.2024 20:09 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This was why I wrote about "DOCOs" - "Data otherwise considered outliers". People will remove outliers; when really you should only remove outliers if it's truly a data encoding error, otherwise change your model to accommodate the process that produce the DOCO.

24.11.2024 20:09 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I think this mentality is very common in Bayesian methods, perhaps in part because switching assumptions doesn't require a deep dive into how to produce accurate test statistics; the posterior is still tractable and equally interpretable.

24.11.2024 20:09 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Well this was incredibly helpful, thank you. Still trying to reconnect with all my twitter circle...

24.11.2024 02:32 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

A list of all the stats/modeling/ML/data starter packs I've seen (26+ and counting):

23.11.2024 19:57 โ€” ๐Ÿ‘ 44    ๐Ÿ” 12    ๐Ÿ’ฌ 6    ๐Ÿ“Œ 3

Stalker 2 + VR works on Linux.
Welp, I'll never see the light of day again.

22.11.2024 03:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

And wouldn't most answer "Not well", which is consistent with their attitude toward the efficiency commission?

13.11.2024 18:47 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Does anyone in academia claim that academia is run efficiently?
I recall one lab took the better part of a year to get a trash can for their lab because the department was unwilling to secure the funds from their own grant.

13.11.2024 18:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Simulation remains one of the single best methods I've learned for either stats or coding. A mix of statistical theory, simulation, good code practices, and domain knowledge can take you really far on hard tasks.
Had a huge win today because of that skill. Feels good.

12.10.2024 04:10 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

1. I dunno? I see a lot of docs including info about returned objects. Certainly better than python anyway.
2. I still am not sure why to include both. But I'm more concerned about how to make it seamless if many devs and users used it. So you can do what you want, but attr is better for all.

23.09.2024 05:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

By contrast if everyone added metadata attributes and by doing so can announce themselves as inheriting from a metadata type, then anyone can return any type, devs wouldn't need monads or to handle metadata outputs, etc.
Makes it all more functional without diving into monads.

21.09.2024 19:02 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

If everyone were to adopt this method, then everyone would have to either export lists or export a list of metadata plus their actual object. We'd then want monads to handle this, but most R users won't understand monads.
What of people who use s4? Output matrices? Vectors?
Etc.

21.09.2024 19:01 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Yes, please do this. It's a much better practice that doesn't kill composability.
Use attr for metadata. Make a class for things with that metadata. Add methods for those classes.

20.09.2024 23:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

We're moving into a new rental this week and it has a dishwasher. I hadn't thought about the dense egg smell that I'm going to be bathed with upon opening it. Ugh

12.08.2024 03:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I was just talking about this with my wife. I despise this smell but love eggs. We don't even have a dishwasher, I just have to rewash so many plates.

12.08.2024 03:39 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
GitHub - stephensrmmartin/lpd8mk2: Linux driver for sending and receiving configuration from the Akai LPD8 Mkii (Mk2) device Linux driver for sending and receiving configuration from the Akai LPD8 Mkii (Mk2) device - stephensrmmartin/lpd8mk2


github.com/stephensrmma...

Linux-friendly configuration tool for the Akai LPD8 Mk2 midi controller. This was a fun 3-evening hackathon.
All this, because I wanted to configure my midi pads as foot pedals for my multi-effect raspberry pi project, and I don't like windows.

10.05.2024 05:50 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

There we go.

06.05.2024 07:27 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Using Wireshark to listen to Sysex midi signals sent by the official windows/osx editor, so I can implement it for Linux. Run experiments and see what hex code means what.

06.05.2024 06:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@stephensrmmartin is following 20 prominent accounts