's Avatar

@archtoad.bsky.social

45 Followers  |  352 Following  |  87 Posts  |  Joined: 31.10.2024  |  2.1222

Latest posts by archtoad.bsky.social on Bluesky

My point is that any black box function that takes in a sequence of words and predicts (assigns probabilities to) the next word is by definition a โ€œlanguage modelโ€ regardless of whatโ€™s going on inside the black box

11.02.2026 14:21 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Word n-gram language model - Wikipedia

Language model has been used for a while to describe using statistics to describe/analyze language. E.g., en.wikipedia.org/wiki/Word_n-...

11.02.2026 13:19 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
a robot is standing in a room with the words `` ai yi yi ! '' written on it . ALT: a robot is standing in a room with the words `` ai yi yi ! '' written on it .

if you see this, quote with a robot that isn't from "Star Wars", "Star Trek", "Dr. Who", or "Transformers.โ€

08.02.2026 00:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Nice! My pet peeve is that this style of excessive try/except is so much harder to debug. More โ€œan error happened, somewhereโ€ logs/errors, instead of just raising an error when the error happened.

04.02.2026 13:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

It very regularly fails which day of the week it is (saying stuff like โ€œWednesday, 1/1/26โ€). I even gave it a โ€œtodayโ€ tool but it doesnโ€™t use itโ€ฆ

01.01.2026 17:37 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I like @hynek.meโ€™s content

17.12.2025 22:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Had my coding agent make a Dockerfile and it copied the AGENTS.md to the image. Was this an attempt at self-preservation?

17.12.2025 00:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@pythonbytes.fm the LinkedIn cringe made me think of thisโ€ฆ โ€œAI/Blockchain/Kombucha startupโ€ !

16.12.2025 01:40 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Brandon Bird: "King of the Cage"

@brandonbird.bsky.social already thought this one through brandonbird.com/kingofcage.h...

19.11.2025 13:31 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I connected my laptop to my piano and typed into the terminal โ€œconnect to my piano and play a few notes with midiโ€ and it worked first try. This is some Star Trek shit. If you told me 5 years ago this would be possible today I would not have believed you.

06.11.2025 21:22 โ€” ๐Ÿ‘ 19    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

โ€œI donโ€™t want to hear from Mitchell because I donโ€™t think I would enjoy her contentโ€ - sure whatever (youโ€™re misrepresenting her work but thatโ€™s your choice). โ€œI donโ€™t want to hear from Mitchell because she doesnโ€™t know how NNs workโ€ makes you sound like an uninformed asshole.

04.11.2025 14:13 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The paper as a whole holds up! Itโ€™s about the risks/limitations of scaling language models - all very relevant today! How many NLP papers from 2020-2021 can you say that about?

04.11.2025 13:39 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

So to recap, you donโ€™t want to ever hear from Mitchell because of one sentence in a paper that summarizes her co-authors position re: a linguistic theory about form vs meaning, which disqualifies her from ever knowing how these things work โ€œin a relevant senseโ€ ?

04.11.2025 13:37 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

The premise of the paper is โ€œthere are risks/downsides to larger models.โ€ Nowhere in the paper does it claim anything like โ€œlanguage models canโ€™t generalize to unseen prompts.โ€ Youโ€™re just straw manning some thesis onto the paper based on the phrase โ€œStochastic Parrots.โ€

04.11.2025 13:14 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I donโ€™t think this is bad faith. Margaret Mitchell has a long CV with plenty of papers that go beyond the scope of the Stochastic Parrots paper that clearly demonstrate she knows how NNs work?

04.11.2025 12:56 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
AGENTS.md AGENTS.md is a simple, open format for guiding coding agents. Think of it as a README for agents.

I just put in my global AGENTS.md that every python project uses uv and briefly explain how to use โ€œuv runโ€ - havenโ€™t had to remind it since

02.11.2025 22:29 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

โ€œTraditional NLP models like BERTโ€ฆโ€

31.10.2025 09:47 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

My takeaway is deberta baseline is the winner here? Way easier to train/deploy. Also what if you scaled the encoder-classifier up to a comparable size?

30.10.2025 12:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Right but we have users who are like โ€œI canโ€™t find the [microsoft] copilot buttonโ€ - getting them to install/figure out Claude code is just not practical.

25.10.2025 11:57 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Good stuff. Does this thinking extend to more general things like Microsoft copilot and ChatGPT? Or are you saying normies should start using coding agents

25.10.2025 01:58 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach We study a novel language model architecture that is capable of scaling test-time computation by implicitly reasoning in latent space. Our model works by iterating a recurrent block, thereby unrolling...

There was an interesting paper earlier this year about a โ€œrecurrent depthโ€ technique that allowed the model to reuse layers โ€ฆ this what you mean? arxiv.org/abs/2502.05171

15.10.2025 08:35 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Yeah plenty of examples of code golf / people trying to put like 5 lines of code in a single line to show that They Can and it just makes unreadable garbage

14.10.2025 16:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Not sure what you mean by โ€œtraditional UXโ€ but Iโ€™d agree that having creative UX people who can think outside the box is more important than ever

07.10.2025 16:55 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Iโ€™ve had many meetings where people are arguing over how the prototype should be built and by the end of the meeting Iโ€™m like โ€œhere it isโ€

21.09.2025 16:08 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Preview
RAPIDS | GPU Accelerated Data Science Open source GPU accelerated data science libraries

I just heard about rapids.ai which is a concrete effort to do all the data science, etc. things on GPUs

25.08.2025 21:24 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
a green witch singing into a microphone with the words in the year 2000 ALT: a green witch singing into a microphone with the words in the year 2000
14.08.2025 10:18 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
GitHub - AnswerDotAI/llms-txt: The /llms.txt file, helping language models use your website The /llms.txt file, helping language models use your website - AnswerDotAI/llms-txt

Something like github.com/AnswerDotAI/... ?

18.07.2025 10:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Was thinking about this re: โ€œwow I should really get better and writing clear and consistent documentation for my repos so my agents know how to use itโ€

15.07.2025 16:48 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Tools: Code Is All You Need The solution to agentic flows was code all along.

Check out lucumr.pocoo.org/2025/7/3/too... from @mitsuhiko.at if you havenโ€™tโ€ฆ basically saying that CLIs >>> MCP (e.g., gh vs GitHub MCP)

15.07.2025 06:51 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I love your concept about building bespoke dev tools (like ways to search logs) for the agents - would love to hear about more of these and how you approach building them!

10.07.2025 23:49 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@archtoad is following 20 prominent accounts