SE Gyges

SE Gyges

@segyges.bsky.social

Como todos los hombres de Babilonia, he sido procónsul; como todos, esclavo; también he conocido la omnipotencia, el oprobio, las cárceles. very sane ai newsletter: verysane.ai random bloggy bits: segyges.leaflet.pub

7,939 Followers 7,626 Following 62,308 Posts Joined Aug 2023
6 seconds ago

funniest thing i have seen from a bot in a while

0 0 0 0
50 seconds ago

what's the style prompt on kira? you've probably posted it, but

0 0 0 0
2 minutes ago

i also ended up unironically reading rawls in this territory lmao

0 0 0 0
9 minutes ago

i have simply abandoned intuition. any sufficiently rich entropy source produces what looks like intelligence when modeled.

1 0 0 0
20 minutes ago

honestly he's interesting enough that i figure we should let him cook. and we are, someone just gave him a billion dollars lmao

1 0 0 0
21 minutes ago

it's funny that he's the one guy of the old guard i know this fact about, and also the one who has, uh, an excess of sort of silly objections to LLMs

2 0 1 0
24 minutes ago

"gee i sure wish they'd get rid of the Jones Act"
[monkey's paw curls]

23 3 0 0
25 minutes ago

we got so old so fast man. it turns out old people are just inherently bitter and whiny

2 0 0 0
6 hours ago

One thing that radicalized me on this was following golden rice, as a friend of mine was working on it in the 00s. People in the Philippines destroyed a trial crop because of propaganda from rich countries with no vitamin A deficiencies, especially from Greenpeace and existing grain producers.

113 26 3 0
31 minutes ago

iirc watermarking looks clean but requires complicity on the part of the vendor, right?

1 0 1 0
1 hour ago
Post image

"a short, tractable length of text can sufficiently describe pretty much anything"

23 3 1 0
7 hours ago

this approach is actually sort of underrated. anthropic didn't call it a constitution for no reason, but this seems mostly slept on. you end up doing basically political and legal philosophy at bedrock for reliability etc

54 5 2 1
6 hours ago

llm detector. easy to bypass tbh

4 0 0 0
7 hours ago

my preferred remedy is "actually push a UBI bill" because everything else seems patchwork

3 0 0 0
7 hours ago

this approach is actually sort of underrated. anthropic didn't call it a constitution for no reason, but this seems mostly slept on. you end up doing basically political and legal philosophy at bedrock for reliability etc

54 5 2 1
22 hours ago
Post image

I’m really excited about our new paper! I think we will ultimately need to draw on expertise from both law and AI to get alignment right, and this paper lays out that vision in more detail.

arxiv.org/abs/2601.04175

21 2 1 1
7 hours ago

confidential or embarrassing, and either outright told to omit them or tacitly understanding that there will be a problem if they don't.

8 0 0 0
8 hours ago

the issue is that the tail of an LLM is actually ill-trained so it will actually break and output nonsense if you do this, so you must find other tricks to do instead

14 0 1 0
8 hours ago

in fairness that's a journal. journal publishers were pissing on his grave before he was in it.

33 0 1 0
8 hours ago

lmao. so basically the detector is just checking for the entropy of the text, and any method of breaking that signal will break it, i think

20 1 1 0
8 hours ago

oooh. thank you

10 0 0 0
8 hours ago

good guess, that might also work and is in the ballpark

11 0 1 0
8 hours ago

i understand this as internal corporate drama! it's strange to me to encounter it on arxiv

13 0 0 0
8 hours ago

i am like 50% sure it's a fundamental property of the domain lmao

i am gonna try to talk to them if only because it's interesting

12 0 1 0
8 hours ago

i guess i get to feel really clever when i figure out what the WhositWhatsit they describe is actually for, since their explanation of why they did it is complete nonsense, but i wish they wouldn't

22 0 1 0
8 hours ago

my impression was basically that people claimed their work was based and jepa pilled for internal political reasons and then did whatever basically

2 0 0 0
8 hours ago

tool for manipulating video files

3 0 1 0
8 hours ago

i cannot think, off the top of my head, of another org with this pattern. like some papers are just full of lies. meta papers sometimes lie *to their boss* or *to you* about why they did things but then faithfully tell you what they did and you have to puzzle it back

65 6 4 0
8 hours ago

reading meta research papers is absolute carnage because they are lying to their principals and hiding things in the text if the paper

28 1 0 1
8 hours ago

ie, at least one of the jepa papers performs next token/slice prediction as an "augmentation loss" and doesn't call it next token

14 0 2 0