Yeah! I think I pass the tests with toilets (at least UK ones. I know that US ones do different things with water that I'm not totally clear on the details of, and there's a whole separate category of mains pressure toilets that work differently again) but lots of things I don't.
16.10.2025 20:03 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Many LM applications may be formulated as text generation conditional on some (Boolean) constraint.
Generate aโฆ
- Python program that passes a test suite.
- PDDL plan that satisfies a goal.
- CoT trajectory that yields a positive reward.
The list goes onโฆ
How can we efficiently satisfy these? ๐งต๐
13.05.2025 14:22 โ ๐ 12 ๐ 6 ๐ฌ 2 ๐ 0
I do not feel more zen now Dave.
10.03.2025 12:08 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Post a photo you took with no context to bring some zen to the timeline
10.03.2025 10:53 โ ๐ 4 ๐ 0 ๐ฌ 1 ๐ 3
(I don't think godlike intelligence is literally impossible, but I do think my probability that we ever get there is significantly under 50%)
07.03.2025 23:47 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
But if "superintelligence" means something closer to godlike intelligence, I might be at never. I guess my working definition is "clearly superhuman on some axis, or peak human on multiple axes you'd not normally find together, and not clearly subhuman in easy to notice ways"
07.03.2025 23:46 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
I'm honestly not sure how to answer this, and I think a lot of that is weighted on my uncertainty on what "unambiguous superintelligence" means. Erring on the side of stricter criteria I think 2050ish is probably the closest I'd confidently put at >= 50% probability.
07.03.2025 23:46 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Shrinkray has very sensible rules that apply well to languages it has no specific knowledge of -- we make a *lot* of use of this and it's brilliant! -- and it scales near-linearly to the number of cores. If you need a reducer, I highly recommend it! Thanks @drmaciver.bsky.social!
06.03.2025 14:24 โ ๐ 9 ๐ 1 ๐ฌ 1 ๐ 0
How do LLMs work?
I wrote some notes on how LLMs work, aimed at non-programmers, that you might find interesting. notebook.drmaciver.com/posts/2025-0...
Feedback, corrections, and follow-on questions very welcome.
08.02.2025 15:42 โ ๐ 8 ๐ 1 ๐ฌ 2 ๐ 0
Started reading this yesterday. It's very good so far, thanks!
01.02.2025 11:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Reminded how my first ever conference talk was a lightning talk about how microservices were bullshit. IIRC you were involved in pushing me into doing it, but I may be conflating events.
22.01.2025 18:00 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
04.01.2025 11:59 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Mostly that bluesky is in the odd position of being both decentralised identity and also having a canonical identity that most people will use. ciphergoth.bsky.social is much more plausibly legitimate than ciphergoth.someother.domain
30.12.2024 13:12 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
That's good, although it sortof feels like this is a nontrivial threat model for scams even without renaming. e.g. this wouldn't have been much better if @ciphergoth.org had always been at the .org rather than previously having the .bsky.social address.
30.12.2024 11:57 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Looks like the account has been suspended at least, but yikes.
30.12.2024 11:07 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Possibly part of the difference is that my balance just isn't very good, so I end up relying a lot more on visual cues to keep me upright, and if I were better at balance I wouldn't need that compensatory strategy.
28.12.2024 20:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I genuinely liked the righteous mind and still find bits of it useful. I also straightforwardly disbelieve most of its empirical claims.
18.12.2024 11:05 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
My mental categorisation of Haidt is "often interesting, rarely correct"
18.12.2024 11:05 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Every time you wonder how something in Hypothesis works, you can usually reverse engineer it from the following design logic:
1. What's the obvious, natural, way to do it.
2. What are the problems with that?
3. Can those be fixed? Not without doing something ridiculous and unreasonable.
4. Do that.
18.12.2024 10:28 โ ๐ 4 ๐ 0 ๐ฌ 0 ๐ 0
Right, it's not that replies are excluded, it's that whether or not it's a reply is irrelevant, it's only explicit mentions that count.
17.12.2024 19:15 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Yeah, it's that.
17.12.2024 16:12 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
@defenderofbasic.bsky.social I think that (possibly in a few minutes) this message is going to show up in your search.
17.12.2024 16:12 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
Actually I have a theory. It's a slightly ridiculous one but I think it's possibly right.
I think replies don't count. I think only explicit mentions do. I'm going to test this shortly one second.
17.12.2024 16:12 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
Yeah on closer inspection, you're right, sorry. It's missing a lot of posts.
17.12.2024 15:32 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
Although entertainingly the link *doesn't* because it strips the query parameters. Sigh.
17.12.2024 14:46 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Seems to work for me
bsky.app/search?q=to%...
17.12.2024 14:46 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
Yeah, I was wondering! Face up is easy, even without aikido. Face down is hard.
16.12.2024 14:15 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
(The circumstances under which homemade ice cream makes sense, for the interested: 1) You must have space for a dedicated ice cream maker. None of this in-freezer bullshit. Ice cream maker or gtfo. 2) You must be making flavours with no store equivalent, or have garden ingredients you want to use.)
16.12.2024 13:11 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
This is probably a skill issue, but I've always found this with hummus.
Making ice at home is a mug's game.
Making ice cream at home... there are circumstances under which this makes sense, but 99+% of people aren't in those circumstances.
16.12.2024 13:09 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
The latest one, which you should be more careful with because I injured myself doing it (people who are less middle-aged-and-hypermobile are more likely to be OK):
Can you get up to standing from lying face down on the floor, without rolling to the side or using your hands?
16.12.2024 13:07 โ ๐ 2 ๐ 0 ๐ฌ 2 ๐ 0
phd @ mit, research @ genlm, intern @ apple
https://benlipkin.github.io/
Founding Tubetrain ๐. Building Demonstrableยฎ at Sixty North. Director for lithium explorer Transition Elements. "utterly competent". Geoscience PhD. 330 ppm COโ. Caver. ๐ณ๐ด๐ฌ๐ง
Assistant professor at NUS. Scaling cooperative intelligence & infrastructure for an increasingly automated future. PhD @ MIT ProbComp / CoCoSci. Pronouns: ็ฅ/ไผ
Infrastructure, humans, production operations, security. Oh, and dogs.
***Cannot access my DMs: please email instead: bioluminescentlyunfolding (@) gmail (dot) com***
I make games about bad things happening to tiny simulated people and help run the Zurich gamedev community. He/him.
Website: https://zarkonnen.com/
Games: https://zarkonnen.itch.io/
Game Hub: https://www.swissgamehub.com/
PyPy/RPython contributor. Half time teaching at Uni Dรผsseldorf. Works on dynamic language implementations. Love street art and art in public spaces, hiking, reading.
they/them
Computer scientist at Imperial College London, specialising in programming languages, software testing, and formal verification. Leader of FastPL group: https://fastpl.doc.ic.ac.uk
Disclaimer: My opinions are not my own. They're beamed to me by aliens.
Mostly here to mess with the tech, but following interesting people anyway
opinions so much not those of my employer that I'm not saying who they are
AI researcher at XBOW. Security, RE, ML. PGP http://keybase.io/moyix/
science communication for scientists - open memetics - ORI
illiterate lawn carrot ๐ (they/it)
Delver at contraptions.venkateshrao.com
i'm playing with ideas, don't take this too seriously, bisks are not necessarily endorsed
Original author of C++ test framework, Catch2, organiser of cpponsea.uk, accuconference.org, cpponline.uk and swiftcraft.uk.
Co-host of @cppcast.bsky.social.