David Rein's Avatar

David Rein

@idavidrein.bsky.social

sentio ergo sum. developing the science of evals at METR. prev NYU, cohere

439 Followers  |  95 Following  |  6 Posts  |  Joined: 27.04.2023  |  1.612

Latest posts by idavidrein.bsky.social on Bluesky

Hey, study author hereโ€”I think this is an overgeneralization.

We find that *experienced, open-source developers working on projects theyโ€™re highly familiar with* are slowed down. This is consistent with many developers being sped up often, e.g. when writing one-off scripts

11.07.2025 00:30 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

Uber driver is telling me how excited he is that he just got this new car because he totaled his previous one ๐Ÿ˜…

24.01.2025 05:34 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

had a similar experience in africaโ€”zebra and wildebeest were just chilling like right next to lions

28.11.2024 21:44 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Daily life at METR:

25.11.2024 19:35 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Testing, testing

25.11.2024 19:24 โ€” ๐Ÿ‘ 12    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

If interpretability gave us as much info as transparent cases did, weโ€™d be a lot further along than we currently are

27.04.2023 04:35 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

omg so true, thanks for letting me know!!

27.04.2023 04:33 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@idavidrein is following 19 prominent accounts