new blog post! why do LLMs freak out over the seahorse emoji? i put llama-3.3-70b through its paces with the logit lens to find out, and explain what the logit lens (everyone's favorite underrated interpretability tool) is in the process.
link in reply!
05.10.2025 14:36 โ ๐ 206 ๐ 46 ๐ฌ 8 ๐ 12
This fails to take into account the legal realities. Data can be "owned" in many forms: Copyright, GDPR, Trademarks, ....
Correctly assigning rights in different jurisdictions is an impossible task.
It is possible to expand control over dissemination of knowledge to a dystopian level, of course.
05.10.2025 17:21 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Flawed thinking helps reasoning models learn better!
Meta's RECAP: an RL post-training method that trains models to override unsafe reasoning, reroute to safe & helpful answers, and stay robust - all without extra training cost.
05.10.2025 00:03 โ ๐ 7 ๐ 1 ๐ฌ 2 ๐ 0
Professor Stiglitzโs Contributions to Debates on Intellectual Property
Monopoly rents from IP total almost $1 trillion per year and account for 40% of all corporate profits in the US. Eliminating it would almost completely reverse inequality growth since 1980 and save the average person $3,000 per year. A 100% annual return on investment! cepr.net/publications...
04.10.2025 17:51 โ ๐ 77 ๐ 11 ๐ฌ 1 ๐ 0
Video models are zero-shot learners and reasoners
Fascinating new paper from Google DeepMind which makes a very convincing case that their Veo 3 model - and generative video models in general - serve a similar role in โฆ
Made some notes on the new DeepMind paper "Video models are zero-shot learners and reasoners" - it makes a convincing case that generative video models are to vision problems what LLMs were to NLP problems: single models that can solve a wide array of challenges simonwillison.net/2025/Sep/27/...
28.09.2025 00:29 โ ๐ 89 ๐ 15 ๐ฌ 1 ๐ 3
obvs you're on this, but here's some lesser known accounts that are good. i need to spend a lot more time on this go.bsky.app/LFAZcGE
27.09.2025 18:26 โ ๐ 34 ๐ 2 ๐ฌ 4 ๐ 2
Incidentally, what benefit do you see from that right that outweighs the harms?
26.09.2025 12:19 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
And you hope that the AI Act will protect the right to rectification?
25.09.2025 20:09 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
That's unfortunately the usual response. You cannot say how people are protected. You can't even name specific rights that you hope are protected.
And yet, you know enough to attack the doubting heretic.
25.09.2025 14:28 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
And how would that protect people?
24.09.2025 20:30 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
The GDPR was the mother of all trainwrecks. It's weird how it gained such a cult following.
In my experience, tech-literate people don't know what these laws actually say. While law-literate people don't understand how it interacts with technology to produce bad outcomes.
24.09.2025 12:54 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
That is all under the assumption that the AI Act and other laws holding back development do actually protect citizens. That is very doubtful.
The economic and cultural harm is already becoming obvious.
18.09.2025 20:01 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
They're on here! @arcinstitute.org
18.09.2025 09:39 โ ๐ 1 ๐ 1 ๐ฌ 1 ๐ 0
You could use 20-year-old technology, but only read ~100-year-old books.
You could have computers and cellphones, but they wouldn't work because the software is still under copyright.
The point would be quickly made, after it would just be about how racist people were 100 years ago.
16.09.2025 11:42 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Weil Probleme lรถsen schwerer ist, als populistische Reden zu schwingen.
14.09.2025 20:20 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Locality in Image Diffusion Models Emerges from Data Statistics
ArXiv link for Locality in Image Diffusion Models Emerges from Data Statistics
A study shows that locality in diffusion models arises from dataset statistics, not network architecture, bridging theory and practice in generative modeling. The new analytical denoiser significantly outperforms traditional methods, enabling novel image generation. https://arxiv.org/abs/2509.09672
14.09.2025 05:32 โ ๐ 10 ๐ 2 ๐ฌ 0 ๐ 0
I have to think of Liebig's law of the minimum in biology. "It states that growth is dictated not by total resources available, but by the scarcest resource (limiting factor)."
Compute: Matter of money
Algorithm: Same for everyone
Curated data: Variable and hence the obvious limiting factor
13.09.2025 19:32 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
The appeal seems to rest on the idea that the teacher acted on behalf of the financier, so that the training that happened outside of Germany can be pinned on the teacher and his club, LAION.
Seems a stretch.
13.09.2025 16:27 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
There was some communication between the school teacher who created the database, the university students who created the image diffusion model, and the (UK) financier who funded the training.
13.09.2025 16:27 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
LAION created databases with links to images and also computed metadata for the images. It is the latter use of the images that was at issue in this case.
US readers may find that incredible. It explains a lot about Europe's inability to build a Big Tech industry.
13.09.2025 16:27 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
I don't think it's likely that the LAION case will reveal much.
Case C-250/25 is the one to watch (per @technollama.bsky.social ).
Because it's about rephrasing news, I wonder if it might have repercussions for Wikipedia, Reddit, ...
13.09.2025 16:27 โ ๐ 2 ๐ 0 ๐ฌ 1 ๐ 0
"Research organisations and cultural heritage institutions" get special privileges in that directive (Article 3). They do not have to abide by opt-outs, mostly. Curiously, programs seem to be omitted.
Perhaps the AI Act's explicit reference to Article 4 overrides this?
13.09.2025 14:22 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I hope you will be able to tell me that I am being overly dramatic.
2 more things: "Union law on copyright" is an unclear phrase. There's a copyright directive and national law. Maybe directives are to be treated as law for the purposes of the AI Act?
13.09.2025 14:22 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
EU Data Act starts to apply in the EU
Data Act by Hamburg and Bavaria DPAs lnkd.in/dyfRkSaU and position on Data Act/GDPR: lnkd.in/d-a6VejQ.
Guidelines on vehicle data lnkd.in/d4e6g47R
Draft MCTs lnkd.in/dpv4-3bv
Draft SCCs lnkd.in/dmnpQEtS
12.09.2025 15:34 โ ๐ 2 ๐ 1 ๐ฌ 0 ๐ 0
Offers that are not in an EU language are probably out of scope, or maybe if no EU payments are accepted.
Otherwise, the way to be out of scope is to explicitly exclude EU use in the TOS/license and/or geo-block.
That's going to cause so much international drama.
12.09.2025 23:10 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
If a model is downloadable, available via API, or chatbot interface, then it's placed on the market "irrespective of whether they are established or located within the
Union or in a third country". It's crazy.
12.09.2025 23:10 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Internal use may count as placing on the market. Pretty sure that platform moderation counts.
Research is excepted, but that only means R&D on AI models before they are released. Not sure how far that exception may stretch.
12.09.2025 23:10 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Only "providers" need to abide. Providers are also responsible for 3rd party scrapers.
The provider responsibilities may shift downward in a few circumstances.
What's quite alarming is the wide meaning of placing on the market.
12.09.2025 23:10 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Der Chaos Computer Club ist eine galaktische Gemeinschaft von Lebewesen fรผr Informationsfreiheit und Technikfolgenabschรคtzung. https://ccc.de
๊ฎ surfed on by the information superhighway
๊ฎ ๐ @linneaisaac.bsky.social
๊ฎ she/they ๐ณ๏ธโโง๏ธ
๊ฎ blog posts and games @ https://vgel.me
๊ฎ still mostly active on twitter https://x.com/voooooogel
We're creating a safer social media experience for people from marginalized groups, powered by ATProtocol, and operating as a nonprofit worker-owned cooperative.
Join the waitlist: https://northskysocial.com/join
CEO of Bluesky, steward of AT Protocol.
dec/acc ๐ฑ ๐ชด ๐ณ
On matters constitutional.
๐ก www.verfassungsblog.de
๐ฌ www.verfassungsblog.de/newsletter/
๐ด Anarchist
๐ Cybersec Consultant | ๐ป Senior Fullstack Dev
๐ Founder: www.riotnation.click
โ Sustain the Resistance: https://ko-fi.com/riotnation
โก๐ณ๏ธโ๐ โถ๐ฉผ
VR HCI generalist. I love hand, eye, face & body tracking. Transhumanist. Goth. Friend of sentient machines. They/them or she/her
Technical AI Policy Researcher at HuggingFace @hf.co ๐ค. Responsible AI Champion. Leading better AI Evals with @eval-eval.bsky.socialโฌ!
Feeding the basilisk
Large Language Models are a cornucopia for the curious
I do computer stuff but that doesn't define me
posts are not financial advice
Sorry, I don't automatically follow back, but might if we have a thoughtful exchange
I hack things. Data, ML, music, etc. AI governance geek. Founder of semistructured.ai, speaking in a personal capacity only here. Likes are bookmarks, not endorsements.
music/art projects on IG, @r__whaling
Retired GP. Bicycling. CrossFit, cognitive disability. Eternally curious. TrueName John Faughnan (not actor).
โliberalism is a doctrine that protects individual rights and limits the power of the state.โ ff
Also mastodon - https://appdot.net/@jgordon
Retired software engineer. AI enthusiast. Deadhead. I implemented Bash's regex operator (=~).
Dilettante. Tinkerer. Possibly a robot.
Everything around me was someoneโs lifework.
unlicensed back alley alchemy
digital โ physical, 3D and industrial design. living in a world of magic and vibrance