Very interesting paper that shows that LLMs are good. But not good enough. I wish the following from the conclusion would have made it into the abstract:
08.10.2025 06:49 β π 1 π 2 π¬ 0 π 0@sherbold.bsky.social
https://www.fim.uni-passau.de/ai-engineering/
Very interesting paper that shows that LLMs are good. But not good enough. I wish the following from the conclusion would have made it into the abstract:
08.10.2025 06:49 β π 1 π 2 π¬ 0 π 0π
02.10.2025 14:23 β π 0 π 0 π¬ 0 π 0Just accepted at TMLR:
We found evidence of copyright violations by LLMs even when we ask questions that were not part of the training. Indeed, we found that the amount of memorized content was independent from the questions being part of the training or not.
openreview.net/forum?id=ddo...
This just in: Leading AI firm discovers confidence thresholds. More on this exciting development in news at 11.
openai.com/index/why-la...
(Honestly, OpenAI!?)
Scientific impact and achievement, redefined:
Huge congrats to #Fraunhofer IIS on winning an #Emmy for their JPEG XS compression standard ππ [β¦]
re
(I miss IRC)
(Now I feel old)
Dear all,
please enjoy your complementary "European Professor goes on Holiday" message.
See you in September.
Yours sincerely,
A European Professor
Good news (for me!) my gender bias paper from 2023 still replicates with GPT-5.
Bad news (for everyone!) my gender bias paper from 2023 still replicates with GPT-5.
arxiv.org/pdf/2308.14921
hkotek.com/blog/gender-...
I wonder what my PhD students will think, once they discover that "someone" glued the three laws to the wall in the hallway. π
06.08.2025 14:01 β π 2 π 1 π¬ 0 π 0Newton's Laws of Graduation, Part 2 - The Second Law
04.08.2025 18:47 β π 46 π 10 π¬ 1 π 1Newton's Laws of Graduation, Part 3 - The Third Law π
06.08.2025 12:50 β π 46 π 9 π¬ 3 π 0Success, a luxury problem, and its solution:
π Our quiz is a huge success and incredibly popular on YouTube with now over 100,000 views.
π We cannot answer all the feedback and comments individually anymore.
π We write a follow up article to answer the most important questions.
It is official, our two long papers at #ACL2025 have now been published. Common work with Arne Rubehn (Concept Embeddings), and Frederic Blum and @sherbold.bsky.social (Automated Language Affiliation).
aclanthology.org/2025.acl-lon...
aclanthology.org/2025.acl-lon...
My debut as TV-Show moderator - now live on Youtube.
We had a lot of fun with how the five professors answered questions on topics ranging from 90's music, counting peas, size of Asian countries, etc.
The only drawback: it is only available in German.
P.S. The humans won.
Wie schlΓ€gt sich KI gegen professorale Expertise? Die Quiz-Show unter Moderation von @sherbold.bsky.social ist nun in voller LΓ€nge online. Wer sich vorab selbst mit der KI messen mΓΆchte, kann dies per Online-Quiz tun: www.digital.uni-passau.de/beitraege/20...
#KeepCALLM #5gegenKI
That was so much fun. I look forward to the video π
18.07.2025 08:22 β π 4 π 0 π¬ 0 π 1I'll just leave that quote here ...
16.07.2025 10:34 β π 2 π 1 π¬ 2 π 0Yesterday: Let's try to ground AI models in reality.
Now: Let's try to ground reality on AI models.
Fixes a lot of issues. I am impressed. π
They should call it LLM as a Physicists, then it gets accepted by the community ... right? (Looking at you, everybody trusting LLM as a judge!)
Happy to share that we published MAMUT @tmlrorg.bsky.social. We defined multiple data augmentation approaches to get more diverse mathematical data and show this improves pre-training.
Congrats to my student Jonathan Drechsel for his first publication! π
www.fim.uni-passau.de/en/ai-engine...
No need, I can already feel the @icseconf.bsky.social paper bidding approaching π
11.07.2025 16:40 β π 1 π 0 π¬ 0 π 0Starting the weekend on a Friday at 4pm with an empty inbox feels kind of strange. Good, but strange.
11.07.2025 14:09 β π 3 π 0 π¬ 1 π 0How much energy is needed to generate an image? π¨π§ β‘οΈ
Up to 4.08 Wh β like charging your phone to 40%!
In our new study we tested 17 models & 9,000+ runs.
Other key finds:
β‘οΈ Model energy use varies up to 46x
π Resolution matters, prompts don't
π οΈ Quantization β savings
π Preprint: lnkd.in/dKWWAETW
KΓΆnnen #LLMs einen neuen Zugang zum Recht erΓΆffnen? DarΓΌber spricht Brian Valerius, Professor fΓΌr #KI im #Strafrecht, mit Rechtsanwalt Sven Galla, der KI bereits in der Praxis einsetzt.
π
Donnerstag, 10. Juli, 18 Uhr
π HΓΆrsaal 13
Mehr Infos: www.digital.uni-passau.de/generative-s...
#KeepCALLM
The truly impressive thing about Zoom is that whenever they update the UI, it gets worse.
04.07.2025 07:02 β π 3 π 0 π¬ 0 π 0New pre-print: If you are wondering which models are good for non-code software engineering tasks, take a look at this work from my student Fabian Pena.
Also: Look at it if you want to know how to use Bayesian stats for ranking models.
arxiv.org/abs/2506.10833
Reviews so far this year: 11 journal papers, 8 conference papers, and 6 registered report protocols.
And I already feel like I decline almost all incoming requests...
Congrats especially also to our main author Frederic Blum!
25.06.2025 20:21 β π 2 π 0 π¬ 0 π 0Wow, our ACL paper with @lingulist.de is selected as oral presentation at the ACL main conference - less than 10 percent of the accepted papers get this honor π€―
25.06.2025 20:20 β π 3 π 0 π¬ 1 π 1βOver four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels. These results raise concerns about the long-term educational implications of LLM reliance and underscore the need for deeper inquiry into AI's role in learning.β arxiv.org/abs/2506.08872
16.06.2025 07:49 β π 161 π 74 π¬ 2 π 20The big lawsuit from Disney and Co finally has arrived. This will be interesting: arstechnica.com/ai/2025/06/i...
11.06.2025 19:00 β π 2 π 0 π¬ 0 π 0