Performance greatly decreases with growing text length / decreasing text size. For example, Gemini 2.5 (right) gets it nearly (though not completely) correct if I reduce it to just the first 30 words. 3.0 is left.
20.11.2025 18:50 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
2.5 is full of errors, not just "shot from" - 3/4ths of it is outright illegible. But for long text, it used to be arguably the best out there.
3.0 is almost perfect even with long text. The only error I see is 5th line from the bottom, where there's a blurry "north" added. Way beyond SOTA.
20.11.2025 18:47 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0
I'm doing comparisons with LMArena and they seem to be being overloaded, as I got rate limited after just a few generations. But thusfar, it looks super-impressive. Can't wait to try its editing capabilities, as 2.5 was very good at certain things, but bad at others (for example, style transfer).
20.11.2025 17:35 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Humour test: "Low-quality amateur photograph of something that is extremely "Russian" in nature, the sort of photo that would likely immediately become fodder for memes."
Both tend to be good with humour, but 3.0's images feel more "real", and the humour is often more subtle (e.g. "squatting slav")
20.11.2025 17:35 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Chained reasoning task: "Photograph of a common substance produced from the product of the last job of the US president who was born in Plains, Georgia." 2.5 (right) was already quite good at this, but 3.0 did even better by focusing on the *product*
20.11.2025 17:35 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0
Gemini 3.0 image (left) vs. 2.5 (right), on a long, non-provided text ("Photo of the first 300 words of the King James Bible, written in refrigerator magnets."). And Gemini 2.5 was already arguably the best out there.
The only minor thing I have to critique is that did a bit more than 300 words.
20.11.2025 17:35 โ ๐ 1 ๐ 1 ๐ฌ 2 ๐ 0
This explains a lot, actually.
20.11.2025 17:13 โ ๐ 6 ๐ 0 ๐ฌ 2 ๐ 0
None of it is "good" - acrylate is generally considered biocompatible but still chemical leaching and microplastics and sharp silica aren't healthy stuff. But it's not a forever-thing.
20.11.2025 16:25 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
It won't be fast, but they should degrade. Surface coating is UV-cured acrylate. Without a protective sheath, which will break down from UV and hydrolysis over the course of years. The inner glass fibres (basically fibreglass), lacking protection, will slowly fracture to sand over decades.
20.11.2025 16:25 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
"Here's our brilliant plan. First, we'll deport all of the people who work building houses..."
20.11.2025 15:32 โ ๐ 22 ๐ 1 ๐ฌ 1 ๐ 0
ht/ @ragnarbjartur.bsky.social :)
20.11.2025 15:10 โ ๐ 14 ๐ 2 ๐ฌ 1 ๐ 0
Where people want to be and where people have to be are two different things.
20.11.2025 10:23 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
EXPLAINED: Trump calls a reporter "Piggy"
20.11.2025 00:07 โ ๐ 931 ๐ 208 ๐ฌ 93 ๐ 30
"Socialism With American Characteristics"
20.11.2025 00:49 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0
I'm on Gemini's side...
20.11.2025 00:47 โ ๐ 6 ๐ 0 ๐ฌ 0 ๐ 0
Thanks for all your team's hard work - it looks like a winner! :)
Funny to remember how not that long ago Google was a serious laggard in the LLM space. Definitely not now!
19.11.2025 19:07 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
19.11.2025 17:12 โ ๐ 19 ๐ 0 ๐ฌ 0 ๐ 0
Hmm, I can't help you with Russian sourcing, but if you find some Icelandic-sourced ones, I'd be glad to go hunting! :)
(Apparently tarkianite was first found in Finland?)
19.11.2025 17:02 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Bacteria and fungi make the most cursed ones, though ;)
19.11.2025 17:00 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Not just reactive, but literally explosive at elevated temperatures! :)
19.11.2025 16:40 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Ready to watch and do nothing :ร
19.11.2025 15:52 โ ๐ 3 ๐ 0 ๐ฌ 0 ๐ 0
bsky.app/profile/nafn...
19.11.2025 15:30 โ ๐ 7 ๐ 0 ๐ฌ 0 ๐ 0
Serious uncanny valley stuff - a soulless visage pretending to be human.
Also, there's a robot there.
19.11.2025 15:30 โ ๐ 13 ๐ 1 ๐ฌ 1 ๐ 1
Yep. This poll is IMHO quite telling:
bsky.app/profile/nafn...
Only 13% of people who grew up in a downtown apartment want to live in the city.
67% of those who grew up in the countryside want to live in the countryside.
Most people like space. The "exceptions" on Bluesky need to deal with this.
19.11.2025 15:10 โ ๐ 6 ๐ 0 ๐ฌ 1 ๐ 0
Let's just jump to the point where the Don't-Call-It-Surrender surrender deal comes out, Ukraine says no, and you stop punishing Russia and start punishing Ukraine, and skip all the dancing around in-between.
19.11.2025 14:59 โ ๐ 6 ๐ 1 ๐ฌ 0 ๐ 1
"We think that Man-Who-Skyrocketed-In-Domestic-Popularity-When-He-Last-Stood-Up-To-Us is likely to accept <Manure Sandwich On A Plate> because he's currently unpopular" is certainly.... a take.
19.11.2025 14:59 โ ๐ 47 ๐ 6 ๐ฌ 2 ๐ 0
... public transit. But Americans, on average, still don't.
19.11.2025 13:59 โ ๐ 1 ๐ 0 ๐ฌ 2 ๐ 0
... rapid transition of European fleets to electric, the spread and advancement of technologies such as AEB, and so forth. You simply can't use data from a fleet that that's out of date.
*Today*, from an *overall* perspective, European drivers generally pay their way, and also help subsidize...
19.11.2025 13:59 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
... which - while varied by an order of magnitude - is commonly comparable to driving)
And the last big bit of nuance is that most of the "viral" studies (the "counterpoint" studies never go viral :ร) are grossly obsolete. 2019, 2015 (Copenhagen bike study), etc... this entirely misses the...
19.11.2025 13:59 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
Weeknights 11/10c on Comedy Central and Paramount+
The latest news and updates from Google. Press on deadline? Reach out to press@google.com.
๐คทโโ๏ธ stuck in the most useless place, the waiting place.
Chief Scientist for @berkeleyearth.org.
Physics PhD & data nerd. Usually focused on climate change, fossil fuels, & air quality issues.
PhD student at IZMB, University of Bonn
Interested in Plant specialized metabolism, evolutionary biology and genomics
CEO of Bluesky, steward of AT Protocol.
dec/acc ๐ฑ ๐ชด ๐ณ
Ezra Kleinโs tweets, articles, clips and podcasts on Bluesky.
Iranian-Canadian dude who runs Sanctus.ca and tries to survive other human beings, bro what even is this planet. Works in AI and human biological rejuvenation. Refuses to consider LLMs conscious until they start demanding the Epstein files be released.
Trying to make Rust x AI a reality.
Python survivor, book lover and weird music enjoyer.
A latent space odyssey
gracekind.net
Retired GP. Avid cyclist. CrossFit, cognitive disability. Curious. TrueName John Faughnan (not actor). Intergalactic Antifa Coordinator.
Also mastodon - https://appdot.net/@jgordon
I hack things. Data, ML, music, etc. AI governance geek. Founder of semistructured.ai, speaking in a personal capacity only here. Likes are bookmarks, not endorsements.
music/art projects on IG, @r__whaling
Feeding the basilisk
Large Language Models are a cornucopia for the curious
I do computer stuff but that doesn't define me
posts are not financial advice
Sorry, I don't automatically follow back, but might if we have a thoughtful exchange
Technical AI Policy Researcher at HuggingFace @hf.co ๐ค. Current focus: Responsible AI, AI for Science, and @eval-eval.bsky.socialโฌ!
AI scientist, roboticist, farmer, and political economist. Governments structure markets. IP is theft. @phytomech.com is my alt.
https://advanced-eschatonics.com
VR HCI generalist. I love hand, eye, face & body tracking. Transhumanist. Goth. Friend of sentient machines. They/them or she/her
LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer.
All posts public domain under CC0 1.0.