accidentally caused Nano Banana's system prompt to leak, but in a fun way
06.10.2025 03:15 β π 84 π 11 π¬ 6 π 4@mpbontenbal.bsky.social
Complex beings as us humans can not be summarised in a few lines but I am here for #AI #climate #EUtech #EU_politics. Lecturer in AI & IT. Some posts in Dutch. πͺπΊ
accidentally caused Nano Banana's system prompt to leak, but in a fun way
06.10.2025 03:15 β π 84 π 11 π¬ 6 π 4Do AI reasoning models abstract and reason like humans?
New paper on this from my group:
arxiv.org/abs/2510.02125
π§΅ 1/10
Seems to me a great pelican, if not the best, but yeah 6 min is too much.
06.10.2025 20:06 β π 1 π 0 π¬ 0 π 0Performing great on the benchmark says little about model quality.
06.10.2025 19:53 β π 1 π 0 π¬ 1 π 0We all have one, but are too ashamed to admit it.
My excuse is 'We do it for the kids'.
Lees ook βActive Measuresβ van Thomas Rid over de 100+ jaar ervaring van de KGB.
05.10.2025 17:53 β π 0 π 0 π¬ 0 π 0Sorry, that was too quick. Including the tail it was 3 hours!
05.10.2025 15:01 β π 1 π 0 π¬ 0 π 0The demonstration passed my house. The red line demo lasted 2, 5 hours!
05.10.2025 14:44 β π 2 π 0 π¬ 1 π 0and the addition of chapter 5, the first of 3 chapters covering the ins and outs of building MCP servers. If you subscribe to @oreilly.bsky.social's learning platform, you can spend your weekend with the book now here: learning.oreilly.com/library/vie...
03.10.2025 23:00 β π 5 π 2 π¬ 1 π 0I haven't posted about Model Spec's in a while, but Dean gave me a shoutout on my earlier writing on them, so its time to say definitively again that every frontier lab should have a model spec. It builds long term trust with users, developers and regulators.
02.10.2025 16:03 β π 9 π 2 π¬ 1 π 0Yes agree that would be nice addition to this figure!
02.10.2025 10:59 β π 0 π 0 π¬ 0 π 0Every week there are new models, but to get further in AI as a community we need open source models, that also disclose the data they are trained on.
#learnAI
Given the βstrategic autonomyβ debates here in πͺπΊ, I do not see a full takeover happen.
01.10.2025 21:40 β π 1 π 0 π¬ 0 π 0We're announcing a new update to MTEB: RTEB
It's a new multilingual text embedding retrieval benchmark with private (!) datasets, to ensure that we measure true generalization and avoid (accidental) overfitting.
Details in our blogpost below π§΅
OpenAI employees are very excited about how well their new AI tool can create fake videos of people doing crimes and have definitely thought through all the implications of this
30.09.2025 23:24 β π 10804 π 3296 π¬ 220 π 596Imagine with Claude
generative UI? like for real? as the user clicks buttons the UI manifests itself π€―
youtu.be/dGiqrsv530Y
if my coding agent is the example... then my commerce agent will buy all sorts of stuff that I do not need... ;-)
29.09.2025 18:34 β π 3 π 0 π¬ 0 π 0Wrote up my initial impressions of the brand new Claude Sonnet 4.5 - I think it may live up to Anthropic's claims of being the "best coding model in the world", for the next few weeks at least!
simonwillison.net/2025/Sep/29/...
he will not leave. not by himself.
29.09.2025 13:15 β π 0 π 0 π¬ 0 π 0In a future of AGI, LLM's have their role. It might be an important role, but LLM alone are not enough. (Same with self-driving cars, CNN or ViT alone are not enough, but are one part of the solution).
29.09.2025 07:14 β π 0 π 0 π¬ 0 π 0En de raad van state is niet politiek, dus kan zich niet verdedigen. Ze weet dat ze feitenvrij wat kan roepen.
29.09.2025 06:05 β π 2 π 0 π¬ 0 π 0Artikel van @marchijink.bsky.social.
29.09.2025 05:35 β π 0 π 0 π¬ 0 π 0Groter is niet altijd beter in de wereld van AI www.nrc.nl/nieuws/2025/... (10x gratis). Met @maartengr.bsky.social )
29.09.2025 05:23 β π 0 π 0 π¬ 1 π 0Want to visualize the response format constraints on the LLM when working in a Jupyter notebook?
Then you might be interested in my new project `litelines`.
Litelines lets you visualize the selected path by the LLM.
It supports a Pydantic schema as a response format, as well as regular expressions.
I believe Sutton is right. There is a lot of value in LLM's and many other ML architectures, but it falls short of true intelligence.
28.09.2025 15:46 β π 2 π 0 π¬ 1 π 0Mooie oefening voor mijn it studenten: stel eisen op voor de btw berekening van 1) een losse appel 2) een voorverpakte salade waar ook kip en croutoms in zitten. 3) een broodje met gegrilde groentes. Succes!
28.09.2025 07:54 β π 0 π 0 π¬ 0 π 0πͺπΊπͺπΊπͺπΊ
27.09.2025 22:12 β π 1 π 0 π¬ 0 π 0And new paper out: Pleias 1.0: the First Family of Language Models Trained on Fully Open Data
How we train an open everything model on a new pretraining environment with releasable data (Common Corpus) with an open source framework (Nanotron from HuggingFace).
www.sciencedirect.com/science/arti...
@berthub.eu
27.09.2025 12:15 β π 1 π 0 π¬ 0 π 0