@archtoad - Bluesky Profile

My point is that any black box function that takes in a sequence of words and predicts (assigns probabilities to) the next word is by definition a “language model” regardless of what’s going on inside the black box

11.02.2026 14:21 — 👍 1 🔁 0 💬 1 📌 0

Word n-gram language model - Wikipedia

Language model has been used for a while to describe using statistics to describe/analyze language. E.g., en.wikipedia.org/wiki/Word_n-...

11.02.2026 13:19 — 👍 0 🔁 0 💬 1 📌 0

a robot is standing in a room with the words `` ai yi yi ! '' written on it . ALT: a robot is standing in a room with the words `` ai yi yi ! '' written on it .

if you see this, quote with a robot that isn't from "Star Wars", "Star Trek", "Dr. Who", or "Transformers.”

08.02.2026 00:20 — 👍 0 🔁 0 💬 0 📌 0

Nice! My pet peeve is that this style of excessive try/except is so much harder to debug. More “an error happened, somewhere” logs/errors, instead of just raising an error when the error happened.

04.02.2026 13:05 — 👍 1 🔁 0 💬 1 📌 0

It very regularly fails which day of the week it is (saying stuff like “Wednesday, 1/1/26”). I even gave it a “today” tool but it doesn’t use it…

01.01.2026 17:37 — 👍 2 🔁 0 💬 1 📌 0

I like @hynek.me’s content

17.12.2025 22:55 — 👍 1 🔁 0 💬 0 📌 0

Had my coding agent make a Dockerfile and it copied the AGENTS.md to the image. Was this an attempt at self-preservation?

17.12.2025 00:40 — 👍 1 🔁 0 💬 0 📌 0

@pythonbytes.fm the LinkedIn cringe made me think of this… “AI/Blockchain/Kombucha startup” !

16.12.2025 01:40 — 👍 5 🔁 0 💬 1 📌 0

Brandon Bird: "King of the Cage"

@brandonbird.bsky.social already thought this one through brandonbird.com/kingofcage.h...

19.11.2025 13:31 — 👍 1 🔁 1 💬 0 📌 0

I connected my laptop to my piano and typed into the terminal “connect to my piano and play a few notes with midi” and it worked first try. This is some Star Trek shit. If you told me 5 years ago this would be possible today I would not have believed you.

06.11.2025 21:22 — 👍 19 🔁 3 💬 1 📌 0

“I don’t want to hear from Mitchell because I don’t think I would enjoy her content” - sure whatever (you’re misrepresenting her work but that’s your choice). “I don’t want to hear from Mitchell because she doesn’t know how NNs work” makes you sound like an uninformed asshole.

04.11.2025 14:13 — 👍 1 🔁 0 💬 0 📌 0

The paper as a whole holds up! It’s about the risks/limitations of scaling language models - all very relevant today! How many NLP papers from 2020-2021 can you say that about?

04.11.2025 13:39 — 👍 1 🔁 0 💬 0 📌 0

So to recap, you don’t want to ever hear from Mitchell because of one sentence in a paper that summarizes her co-authors position re: a linguistic theory about form vs meaning, which disqualifies her from ever knowing how these things work “in a relevant sense” ?

04.11.2025 13:37 — 👍 1 🔁 0 💬 2 📌 0

The premise of the paper is “there are risks/downsides to larger models.” Nowhere in the paper does it claim anything like “language models can’t generalize to unseen prompts.” You’re just straw manning some thesis onto the paper based on the phrase “Stochastic Parrots.”

04.11.2025 13:14 — 👍 1 🔁 0 💬 1 📌 0

I don’t think this is bad faith. Margaret Mitchell has a long CV with plenty of papers that go beyond the scope of the Stochastic Parrots paper that clearly demonstrate she knows how NNs work?

04.11.2025 12:56 — 👍 3 🔁 0 💬 1 📌 0

AGENTS.md AGENTS.md is a simple, open format for guiding coding agents. Think of it as a README for agents.

I just put in my global AGENTS.md that every python project uses uv and briefly explain how to use “uv run” - haven’t had to remind it since

02.11.2025 22:29 — 👍 2 🔁 0 💬 1 📌 0

“Traditional NLP models like BERT…”

31.10.2025 09:47 — 👍 4 🔁 0 💬 0 📌 0

My takeaway is deberta baseline is the winner here? Way easier to train/deploy. Also what if you scaled the encoder-classifier up to a comparable size?

30.10.2025 12:04 — 👍 0 🔁 0 💬 0 📌 0

Right but we have users who are like “I can’t find the [microsoft] copilot button” - getting them to install/figure out Claude code is just not practical.

25.10.2025 11:57 — 👍 0 🔁 0 💬 1 📌 0

Good stuff. Does this thinking extend to more general things like Microsoft copilot and ChatGPT? Or are you saying normies should start using coding agents

25.10.2025 01:58 — 👍 2 🔁 0 💬 1 📌 0

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach We study a novel language model architecture that is capable of scaling test-time computation by implicitly reasoning in latent space. Our model works by iterating a recurrent block, thereby unrolling...

There was an interesting paper earlier this year about a “recurrent depth” technique that allowed the model to reuse layers … this what you mean? arxiv.org/abs/2502.05171

15.10.2025 08:35 — 👍 2 🔁 0 💬 1 📌 0

Yeah plenty of examples of code golf / people trying to put like 5 lines of code in a single line to show that They Can and it just makes unreadable garbage

14.10.2025 16:38 — 👍 1 🔁 0 💬 0 📌 0

Not sure what you mean by “traditional UX” but I’d agree that having creative UX people who can think outside the box is more important than ever

07.10.2025 16:55 — 👍 3 🔁 0 💬 1 📌 0

I’ve had many meetings where people are arguing over how the prototype should be built and by the end of the meeting I’m like “here it is”

21.09.2025 16:08 — 👍 3 🔁 0 💬 0 📌 1

RAPIDS | GPU Accelerated Data Science Open source GPU accelerated data science libraries

I just heard about rapids.ai which is a concrete effort to do all the data science, etc. things on GPUs

25.08.2025 21:24 — 👍 2 🔁 0 💬 1 📌 0

a green witch singing into a microphone with the words in the year 2000 ALT: a green witch singing into a microphone with the words in the year 2000

14.08.2025 10:18 — 👍 0 🔁 0 💬 0 📌 0

GitHub - AnswerDotAI/llms-txt: The /llms.txt file, helping language models use your website The /llms.txt file, helping language models use your website - AnswerDotAI/llms-txt

Something like github.com/AnswerDotAI/... ?

18.07.2025 10:03 — 👍 2 🔁 0 💬 1 📌 0

Was thinking about this re: “wow I should really get better and writing clear and consistent documentation for my repos so my agents know how to use it”

15.07.2025 16:48 — 👍 0 🔁 0 💬 0 📌 0

Tools: Code Is All You Need The solution to agentic flows was code all along.

Check out lucumr.pocoo.org/2025/7/3/too... from @mitsuhiko.at if you haven’t… basically saying that CLIs >>> MCP (e.g., gh vs GitHub MCP)

15.07.2025 06:51 — 👍 0 🔁 0 💬 1 📌 0

I love your concept about building bespoke dev tools (like ways to search logs) for the agents - would love to hear about more of these and how you approach building them!

10.07.2025 23:49 — 👍 4 🔁 0 💬 0 📌 0

Latest posts by archtoad.bsky.social on Bluesky

@archtoad is following 20 prominent accounts