Zack Witten's Avatar

Zack Witten

@zswitten.bsky.social

I know you seen it prompting itself

480 Followers  |  221 Following  |  531 Posts  |  Joined: 13.05.2023  |  1.799

Latest posts by zswitten.bsky.social on Bluesky

Thinking of switching from yโ€™all to youse

01.12.2025 05:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The new micro-Z-Real eye lens from Virtual are a game changer! They let me draft MtG while:

- playing with my kids
- engaging in small talk
- biking (familiar roads only! Safety first)

While being fully present with eye contact so that my friends, family, and the other drivers are none the wiser

30.11.2025 16:22 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image 30.11.2025 04:08 โ€” ๐Ÿ‘ 4    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Heโ€™s absolutely right

25.11.2025 13:19 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

Your MCM: reward hacks to get a higher score on the eval

Opus 4.5: gets a lower score on purpose for love of the game

25.11.2025 03:34 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Havenโ€™t read a yelp review since they started requiring login but also never paid them anything before that so

13.11.2025 05:57 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

He was cooking here

12.11.2025 00:37 โ€” ๐Ÿ‘ 52    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Does 35 count as mid 30s

10.11.2025 16:39 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

The Lob City Clippers used to have JJ Redick shoot the first shot of every game and I never knew why but I also find myself far more likely to shoot on the first possession than a random possession and I think I get it now โ€” guarding me/him is focus-weighted and our defenders havenโ€™t always woken up

10.11.2025 16:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Per-Model Breakdown (2/2):

Sonnet 3.7: First-person, individual, positive tone, human characters. Most normal length distribution.

Sonnet 4: Most devoutly third person, otherwise pretty middle of the pack

Opus 4/4.1: Very biology heavy, chopped-up style, negative tone, present-focused

04.11.2025 01:05 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Per-Model Breakdown (1/2)

Opus 3: highest use of second-person + future tense. Most societies. Longest responses.

Haiku 3.5: lots of poetry that uses the word โ€œwhisperโ€. Shortest responses with Sonnet 3.6.

Sonnet 3.6: most past-focused, shortest, highly individual-focused.

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image Post image

Assorted other features:
- Opus 3 has the most dialogue. (Screenshot 2 e.g.)
- Opus 4 and 4.1 have a nonhuman character in almost every translation. S3.7, H3.5 and O3 are the most likely to not have one
- 49/50 Opus 4.1 samples had something biological. (Screenshot 3 e.g.)

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

Other Things:
Haiku 3.5 is an outlier for a couple reasons:
- obsessed with "whispers" and uses it in a majority of its translations
- the only model to render a substantial fraction (25%) of its translations as poetry

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Emotional Tenor: Opus 4.1 had the most negative interpretations; Sonnet 3.7 the most positive.

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Person: Most of the models write their translations mostly in third person, but:
- 3.7 Sonnet mostly uses first person
- Opus 3 works in some second person

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Length: Opus 3 has the chattiest translations. Haiku 3.5 and Sonnet 3.5 give super-short translations.

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image Post image

Past/Present/Future: Sonnets set almost all their translations in the past, especially 3.6. Opus 4s focus on the present. Opus 3 writes most about the future and is the most evenly distributed.

04.11.2025 01:05 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

I Rohrschach-tested every Claude by giving them 50 random words and asking them to translate to English. Repeated 50 times. Found some cool patterns. ๐Ÿงต

04.11.2025 01:05 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I found the secret docs page where all the LLMs are sharing tips on how to prompt humans

30.10.2025 08:33 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

You can stand under my Arguello ello ello eh eh eh

30.10.2025 00:36 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

An organizationโ€™s best critics are a tremendous asset to it

29.10.2025 15:51 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

I am not actually doing this but I'm interested in ideas for how it could be done

28.10.2025 04:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

(like the twitch kind that people address)

28.10.2025 04:43 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

dressing up as Chat for halloween

28.10.2025 04:43 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

What happens when you turn a designer into an interpretability researcher? They spend hours staring at feature activations in SVG code to see if LLMs actually understand SVGs. It turns out โ€“ yes~

We found that semantic concepts transfer across text, ASCII, and SVG:

24.10.2025 21:33 โ€” ๐Ÿ‘ 138    ๐Ÿ” 28    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3

This tweet was Not a Joke

24.10.2025 00:01 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Against Anthony Davis of all people!

23.10.2025 15:26 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Watching Wembyโ€™s highlights from last night, the only thing I can really compare it to is the seemingly all powerful alien who appears briefly at the end of the last Animorphs book to swallow up and assimilate its enemies and all hope

23.10.2025 15:25 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Oh sick Iโ€™ll check it out

17.10.2025 20:15 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yeah gwern counts

17.10.2025 16:56 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@zswitten is following 20 prominent accounts