@darsnack.bsky.social
NeuroAI Scholar @ CSHL https://www.darsnack.info Previously maintaining FluxML to procrastinate Previously EE PhD at UW-Madison, comp. eng. / math at Rose-Hulman
Here's a version that lets me look at individual layers and heads (a layer just being the mean of all of its heads). Notice some heads really care about specific tokens that others don't.
19.02.2026 13:16 — 👍 22 🔁 1 💬 1 📌 0You wrote “average attention score as the prompt progresses” so I was just clarifying for myself average over what
19.02.2026 12:39 — 👍 0 🔁 0 💬 1 📌 0So the opacity is a running average score?
19.02.2026 12:31 — 👍 0 🔁 0 💬 1 📌 0What do LLMs see?
I wrote a lil' tool that extracts the attention matrices out of open models and creates this typing visual, with each token's opacity changing according to its average attention score as the prompt progresses. Dimmer words are considered less important to the model.
Screenshot of the vancouver metro area rendered by the Stamen Design water color tileset, and it's gorgeous. it looks like a real water color painting
i just discovered Stamen's Watercolor tileset and it's just beautiful! I wish all maps where this beautiful.
Bring more whimsy and beauty to websites
We had this beautiful internet, and now it's filled with slop. Outside of my circles, I don't trust digital interactions with people, and I know they don't trust me either. (Some effects spill over into real life too.)
Was it worth it? Time will tell.
1/ Finally wrote up “The Story of Mendeley”! Most people know the tool, few know about its rise and fall. The Mendeley story provides important clues for how to build self-sustaining AND non-extractive knowledge commons, which is why I think it deserves more attention 🧵
13.02.2026 20:55 — 👍 75 🔁 37 💬 6 📌 4Highest quality content on here
13.02.2026 15:24 — 👍 1 🔁 0 💬 0 📌 0On a lark, tested Opus 4.6 on a common academic task: take a 10k word article and shorten it to 6k words for submission to a new journal. I told it to use the command line tool texcount to count words given the known inability to count words. It failed in a fairly funny way…
12.02.2026 00:24 — 👍 12 🔁 2 💬 2 📌 0And ML agrees! https://arxiv.org/abs/1803.03635 https://proceedings.neurips.cc/paper/2020/hash/322f62469c5e3c7dc3e58f5a4d1ea399-Abstract.html
09.02.2026 13:00 — 👍 2 🔁 0 💬 0 📌 0And it’s not because of what scientists did. It is a political change that we did not ask for, nor control.
But now that is has happened, we can’t ignore it.
Scoop of ice cream in a shot of espresso, served in a wide-brimmed glass
I'm starting (a)ffogato fridays. it's legal. it's legal for me to have ice cream in a shot of espresso at 9am on a friday. it's legal for me to have a little treat.
07.02.2026 04:07 — 👍 51 🔁 4 💬 6 📌 2Global warming discourse 10 years ago immunized me against predicting an end to long running trends on the basis of a few data points, because this little blip made people think global warming had stopped. Afterwards it wasn't even visible in the trend.
06.02.2026 19:05 — 👍 32 🔁 2 💬 1 📌 0The Bluesky mute words panel showing “Waymo” muted for the next 24 hours
07.02.2026 13:56 — 👍 0 🔁 0 💬 0 📌 0Network propagation on this site is fun. You can watch how misinformation spreads just scrolling through your feed. No sleuthing required.
07.02.2026 13:33 — 👍 0 🔁 0 💬 0 📌 0Same for programming. One might say languages/communicating succinctly ≈ tool for thinking.
07.02.2026 00:21 — 👍 0 🔁 0 💬 0 📌 0Elon's decision to shut down USAID is the direct cause of countless deaths. That he isn't persona non grata in many communities after that is a source of shame.
05.02.2026 15:19 — 👍 195 🔁 33 💬 1 📌 0In almost all (all?) latent diffusion models, the VAE that sandwiches the diffusion model is not a real VAE: the KL loss has a weight of 10^-5 and there is an additional GAN loss that significantly boosts the visual quality. So it's more an adversarial auto-encoder than anything else.
03.02.2026 22:45 — 👍 4 🔁 1 💬 1 📌 0Okay this JPL article has more details. It actually used image data as context which is pretty impressive!
https://www.nasa.gov/missions/mars-2020-perseverance/perseverance-rover/nasas-perseverance-rover-completes-first-ai-planned-drive-on-mars/
The output as markup makes sense, but what is the input context?
01.02.2026 14:34 — 👍 1 🔁 0 💬 1 📌 0every epstein file drop underscores how elite power operates through shared socio-economic networks, regardless of people's ideological differences, populist posturing, or public feuds
30.01.2026 23:58 — 👍 30082 🔁 6554 💬 388 📌 309This study by people from Anthropic itself should raise huge alarm bells about the use of AI in teaching how to code (and later on in coding itself, but esp. in the learning stage).
And remember: this is by the people who make Claude!
tl;dr: not that long, read it
www.anthropic.com/research/AI-...
good afternoon
31.01.2026 00:35 — 👍 12 🔁 1 💬 0 📌 1Ah I see it does have a built-in labeler. Maybe adjusting these settings will help: https://bsky.app/profile/moderation.bsky.app
30.01.2026 17:31 — 👍 1 🔁 0 💬 0 📌 0Does bsky have a built-in spam label? I know you can subscribe to labelers (many exist for spam/bots) and when you do, you choose whether labeled accounts just have a label or are hidden. It would be nice to have a “show hidden” UX though.
30.01.2026 17:24 — 👍 0 🔁 0 💬 1 📌 0The City’s Budget is our future. And you deserve to know how it works.
30.01.2026 14:09 — 👍 6403 🔁 877 💬 109 📌 405slack notification about AI searches
didn't really need this notification popping up at random in every slack I'm in
29.01.2026 22:35 — 👍 21 🔁 1 💬 2 📌 0me: How should I invest my savings to avoid the AI bubble?
tech friends: What bubble?
non-tech friends: What savings?