's Avatar

@krumeto.bsky.social

Senior Data Scientist at Financial Times. Python/R, ML, NLP, but also the occasional local politics. Dad of two girls. Opinions are my own.

69 Followers  |  285 Following  |  48 Posts  |  Joined: 01.11.2024  |  2.4212

Latest posts by krumeto.bsky.social on Bluesky

David Greybeard, a wild chimpanzee, gets a handout of bananas from author Goodall, who studies the apes under a National Geographic Society grant. This scene near
her camp represents a triumph for Miss Goodall; at first the animals fled if she came
within 500 yards. Knapsack holds camera and notebook. She carries a whistle in her
pocket to summon searchers in case of accident in the rugged Tanganyika hills.

David Greybeard, a wild chimpanzee, gets a handout of bananas from author Goodall, who studies the apes under a National Geographic Society grant. This scene near her camp represents a triumph for Miss Goodall; at first the animals fled if she came within 500 yards. Knapsack holds camera and notebook. She carries a whistle in her pocket to summon searchers in case of accident in the rugged Tanganyika hills.

Had never read Jane Goodall's original 1963 article in Nat Geo on the wild chimpanzees in Tanzania until now. It's a wonderful blend of science and journalism, and well worth your time.

www.nationalgeographic.com/pdf/jane-goo...

01.10.2025 20:15 β€” πŸ‘ 23    πŸ” 8    πŸ’¬ 2    πŸ“Œ 1
Post image

EU–INC is the single best thing Europe could do to catch-up in the AI race

A simple unified pan-European startup structure, with modern employee ownership and simple access to capital, able to tap into Europe’s full talent pool.

‼️ but it’s at high risk of not seeing the light of day. You can helpπŸ‘‡

27.09.2025 10:52 β€” πŸ‘ 36    πŸ” 13    πŸ’¬ 3    πŸ“Œ 0
Post image

A cool use of AI in newsgathering, this. US companies are talking more and more about the risks rather than the benefits of AI in their SEC filings (while still being super-optimistic in earnings calls). By @melissahei.bsky.social @chriscook.news & @claradoodle.bsky.social www.ft.com/content/e93e...

23.09.2025 11:45 β€” πŸ‘ 46    πŸ” 17    πŸ’¬ 1    πŸ“Œ 2

Andor would want you to cancel your Disney+ subscription.

It's the least you can do for the Rebellion.

20.09.2025 04:15 β€” πŸ‘ 13281    πŸ” 2947    πŸ’¬ 487    πŸ“Œ 133
Video thumbnail
16.09.2025 14:40 β€” πŸ‘ 10326    πŸ” 4365    πŸ’¬ 79    πŸ“Œ 232
Introducing gpt-realtime Released a few days ago (August 28th), gpt-realtime is OpenAI's new "most advanced speech-to-speech model". It looks like this is a replacement for the older gpt-4o-realtime-preview model that was rel...

An update on this: I've confirmed that "gpt-realtime has a mix of data specific enough to itself that its not really 4o or 5" - see quote from OpenAI at bottom of simonwillison.net/2025/Sep/1/i...

02.09.2025 16:46 β€” πŸ‘ 13    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
The perils of vibe coding I was interviewed by Elaine Moore for this opinion piece in the Financial Times, which ended up in the print edition of the paper too! I picked up a copy …

I was interviewed in the Financial Times about "the perils of vibe coding", and it made it into the print edition! simonwillison.net/2025/Aug/29/...

29.08.2025 18:04 β€” πŸ‘ 115    πŸ” 8    πŸ’¬ 4    πŸ“Œ 1
Post image Post image

Big news - Anthropic agrees to settle with authors in copyright lawsuit.

As tech bros like to say, β€˜we’re only just getting started’.

www.reuters.com/sustainabili...

26.08.2025 19:01 β€” πŸ‘ 28    πŸ” 12    πŸ’¬ 3    πŸ“Œ 0
Preview
Leaked Memo: Anthropic CEO Says the Company Will Pursue Gulf State Investments After All β€œUnfortunately, I think β€˜No bad person should ever benefit from our success’ is a pretty difficult principle to run a business on,” wrote Anthropic CEO Dario Amodei in a note to staff obtained by WIRE...

β€œIt’s perfectly consistent to advocate for a policy of β€˜No one is allowed to do x,’ but then if that policy fails and everyone else does X, to reluctantly do x ourselves.” πŸ€”

X=help dictators

www.wired.com/story/anthro...

22.07.2025 01:34 β€” πŸ‘ 63    πŸ” 16    πŸ’¬ 4    πŸ“Œ 4

A link to the Bulgarian model, in case somebody is interested - huggingface.co/HPLT/hplt_be...

18.07.2025 12:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

That was my theory going into the piece, but it turned out to be not only false but seemingly the opposite: the biggest employment gains young male grads have made in the past year have been in software and engineering jobs

18.07.2025 11:31 β€” πŸ‘ 104    πŸ” 16    πŸ’¬ 3    πŸ“Œ 0

Hah, missed this that other say

Elon: "It is surprisingly hard to avoid both woke libtard cuck and mechahitler!"

(Narrator: this was not surprising at all)

15.07.2025 18:38 β€” πŸ‘ 48    πŸ” 2    πŸ’¬ 6    πŸ“Œ 0

That's fun!

14.07.2025 15:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Advertising, with guardrails and healthy metrics, funds a healthy media ecosystem.

13.07.2025 18:20 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Hoping Bluesky shines, and the cesspool, X, sinks to the bottom - The Boston Globe Bluesky, a relatively new social media platform in the mold of what's now Elon Musk's site, is prioritizing sports content to grow.

Sunday media. www.bostonglobe.com/2025/07/12/s...

13.07.2025 04:26 β€” πŸ‘ 217    πŸ” 40    πŸ’¬ 32    πŸ“Œ 12
Post image

A very good, and rather sad, Big Read on children and reading from @emmavj.bsky.social

on.ft.com/4krOKZA How to get children reading again

04.07.2025 07:20 β€” πŸ‘ 260    πŸ” 97    πŸ’¬ 42    πŸ“Œ 32
Preview
ChatGPT referrals to news sites are growing, but not enough to offset search declines | TechCrunch Not surprisingly, organic traffic has also declined, dropping from over 2.3 billion visits at its peak in mid-2024 to now under 1.7 billion.

ChatGPT referrals to news sites are growing, but not enough to offset search declines

03.07.2025 10:46 β€” πŸ‘ 15    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0
Post image

‼️Sentence Transformers v5.0 is out! The biggest update yet introduces Sparse Embedding models, encode methods improvements, Router module for asymmetric models & much more. Sparse + Dense = πŸ”₯ hybrid search performance!

Details in 🧡

01.07.2025 14:00 β€” πŸ‘ 17    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

Wild.

30.06.2025 00:29 β€” πŸ‘ 51    πŸ” 13    πŸ’¬ 4    πŸ“Œ 3
At no point in this study did I use AI. This includes the literature review, logical organization, writing, analysis, and proofreading. I do not cite any studies that use AI, to the extent that it is possible to know.

At no point in this study did I use AI. This includes the literature review, logical organization, writing, analysis, and proofreading. I do not cite any studies that use AI, to the extent that it is possible to know.

Yesterday I used this "AI -Free Statement" in a conference talk for the first time. Still trying to figure out the exact language.

Feel free to borrow, modify, etc.

27.06.2025 12:12 β€” πŸ‘ 2375    πŸ” 621    πŸ’¬ 80    πŸ“Œ 79

Honestly, copyright never seemed like the right approach to this to me, so I'm not surprised.

Copyright law was designed in the era of books, and only slightly updated for the internet. It was meant to stop people literally *copying* a work. It has no appropriate tools to deal with generative AI.

26.06.2025 14:12 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Once again, it must be said as loudly as possible: AI can't do the things described in this paragraph, and there is no pathway to it ever being able to do them.

25.06.2025 11:41 β€” πŸ‘ 2030    πŸ” 656    πŸ’¬ 79    πŸ“Œ 33
Preview
The copyright war between the AI industry and creatives We have surely gone beyond being able to give the tech sector the benefit of the doubt

The copyright war between the AI industry and creatives https://on.ft.com/3T450Vm | opinion

23.06.2025 04:22 β€” πŸ‘ 44    πŸ” 10    πŸ’¬ 3    πŸ“Œ 4

It’s fascinating to see Microsoft and Apple go in such different directions when it comes to AI.

Microsoft went all-in and wrote β€œSparks of AGI” www.microsoft.com/en-us/resear...

Apple has done the bare minimum and just wrote β€œThe Illusion of Thinking” ml-site.cdn-apple.com/papers/the-i...

08.06.2025 12:40 β€” πŸ‘ 66    πŸ” 16    πŸ’¬ 6    πŸ“Œ 4
Post image

South Korea gender divide update 😲

Young men lean right by 50 points
(74% conservative vs 24% centre-left)

Young women lean left by 22 points
(58% centre-left vs 36% cons)

04.06.2025 10:01 β€” πŸ‘ 1127    πŸ” 372    πŸ’¬ 51    πŸ“Œ 183

One of the things I most resent about AI is the way it has inserted itself, unwelcomed, between me and my students, especially the ones I don't know all that well yet. I hate that every time I read a piece of student writing a small piece of me is thinking "is this a robot or a person?"

02.06.2025 16:07 β€” πŸ‘ 1246    πŸ” 233    πŸ’¬ 30    πŸ“Œ 22
Agents are models using tools in a loop I was going slightly spare at the fact that every talk at this Anthropic developer conference has used the word "agents" dozens of times, but nobody ever stopped to provide …

Anthropic's Hannah Moran finally addressed the elephant in the room at this conference when she subtly dropped "Agents are models using tools in a loop" during the intro to the "Prompting for Agents" workshop simonwillison.net/2025/May/22/...

22.05.2025 19:28 β€” πŸ‘ 37    πŸ” 7    πŸ’¬ 1    πŸ“Œ 5

Buried in the tax bill: "no State or political subdivision thereof may enforce any law or regulation regulating artificial intelligence models, artificial intelligence systems, or automated decision systems during the 10-year period beginning on the date of the enactment of this Act."

22.05.2025 14:22 β€” πŸ‘ 2078    πŸ” 1070    πŸ’¬ 123    πŸ“Œ 302
Preview
disco-eth/EuroSpeech Β· Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

EuroSpeech: Massive Multilingual Parliamentary Speech Corpus

- 78,100+ hours across 22 European languages
- 50,500+ hours of quality-filtered data (CER < 20%)
- Robust alignment algorithm for non-verbatim texts
- Dramatically expands resources for 19+ languages

huggingface.co/datasets/dis...

21.05.2025 07:58 β€” πŸ‘ 36    πŸ” 11    πŸ’¬ 0    πŸ“Œ 1
Post image

Today, we’re announcing the preview release of ty, an extremely fast type checker and language server for Python, written in Rust.

In early testing, it's 10x, 50x, even 100x faster than existing type checkers. (We've seen >600x speed-ups over Mypy in some real-world projects.)

13.05.2025 17:00 β€” πŸ‘ 332    πŸ” 84    πŸ’¬ 14    πŸ“Œ 14

@krumeto is following 20 prominent accounts