Steffen Herbold sherbold - Bluesky Statics

A new study led by @sherbold.bsky.social from our university is investigating who is liable when #AI is misused to create child sexual abuse material. The authors reach a clear conclusion: "Anyone who develops AI must implement clear protective mechanisms – both technical and legal."

14.01.2026 10:25 — 👍 1 🔁 1 💬 0 📌 0

Announcing the #ICML2026 policy for self-ranking in reviews!
1. Authors rank their submissions
2. Reviews are submitted
3. The "Isotonic Mechanism" is run on rankings and review scores
4. Large discrepancies are flagged to ACs and SAC

CC @wjsu.bsky.social

Post: blog.icml.cc/2026/01/12/i...

12.01.2026 15:06 — 👍 6 🔁 6 💬 1 📌 1

This is not even considering the dilemma group leaders face when they are forced to rank their possibly very different, but still all high-quality publications of multiple group members ...

12.01.2026 16:14 — 👍 1 🔁 0 💬 0 📌 0

So authors wo submit a lot of low-quality papers will get special attention for their good papers if the scores are low.

Authors with a single paper will not get special attention when their scores are low.

Cool. I see no way how this could backfire 😐

12.01.2026 16:14 — 👍 0 🔁 0 💬 1 📌 0

Proud at my student Anamaria Mojica-Hanke for leading this work. It was a great collaboration with Thomas Goger (prosecutor at Bavaria's cybercrime unit), Brian Valerius (professor for AI in criminal law) and his student Svenja Wölfel.

09.01.2026 14:45 — 👍 0 🔁 0 💬 0 📌 0

tl;dr: The really bad content moderation of X/Grok is exactly what could lead to criminal prosecution for aiding and abetting CSAM generation. Slow take-down and making the content available via X adds more legal peril.

(Note: analysis based on German criminal law.)

09.01.2026 14:45 — 👍 0 🔁 0 💬 1 📌 0

Criminal Liability of Generative Artificial Intelligence Providers for User-Generated Child Sexual Abuse Material The development of more powerful Generative Artificial Intelligence (GenAI) has expanded its capabilities and the variety of outputs. This has introduced significant legal challenges, including gray a...

While the Grok-caused CSAM scandal is happening over on X, our work discussing the possible criminal liability of X (and others when publishing generative models) has been accepted the International Conference on AI Engineering.

The preprint is already online: arxiv.org/abs/2601.03788

09.01.2026 14:45 — 👍 7 🔁 0 💬 1 📌 1

Fun Christmas party of our research group yesterday. Fortunately, we found the exit!

03.12.2025 08:18 — 👍 0 🔁 0 💬 0 📌 0

Vacancies

We have 4 open PhD positions in the Future of Software Engineering (FUSE) lab! Topics:
- code review efficiency
- predictive software testing
- automated code refactoring
- engineering productivity metrics

(1/2)

25.11.2025 12:43 — 👍 5 🔁 1 💬 1 📌 0

YouTube video by Universität Passau Forschung mit Strahlkraft in die Region und darüber hinaus

Talente auf der Bühne, spannende Projekte, lebendiger internationaler Austausch und verborgene Talente: Rückblick auf die Forschungskommunikation 2025 u.a. mit @lingulist.de, @sherbold.bsky.social, @haeussler.bsky.social, @mgrani.bsky.social, @hedwigeisenbarth.bsky.social, @passaudpe.bsky.social:

17.11.2025 13:13 — 👍 3 🔁 4 💬 0 📌 0

The whos, whats, and whys of issues related to personal data and data protection in open-source projects on GitHub - Empirical Software Engineering Data protection regulations such as the General Data Protection Regulation (GDPR) in the European Union and the California Consumer Privacy Act (CCPA) in the US affect how software may handle the pers...

“The whos, whats, and whys of issues related to personal data and data protection in open-source projects on GitHub” by Anne Hennig, Lukas Schulte, Steffen Herbold, Oksana Kulyk, and Peter Mayer will be published in #EMSE! It examines discussions on personal data and data protection on #GitHub. 1/2

07.11.2025 09:58 — 👍 4 🔁 2 💬 1 📌 0

Passau study shows: AI passes as second corrector in exams Researchers at the University of Passau have had human examiners compete against OpenAI's ChatGPT – and were themselves surprised by some of the results. The study has been published in the renowned N...

🤖🎓 How good is #ChatGPT as a university correction assistant? Researchers from our university investigated this question – and were surprised by some of the results. The findings have been published in #ScientificReports @natureportfolio.nature.com

Original study: www.nature.com/articles/s41... 🧪

27.10.2025 08:38 — 👍 2 🔁 1 💬 0 📌 0

MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Da...

Jonathan Drechsel, Anja Reusch, Steffen Herbold

Action editor: Hongsheng Li

https://openreview.net/forum?id=khODmRpQEx

#embeddings #notation #representations

20.10.2025 20:18 — 👍 2 🔁 1 💬 0 📌 0

Professor Tomas Sauer (standing) and Professor Christoph Heinzl demonstrate the visualization of a large industrial CT data set in a three-dimensional representation.

🚀 Shaping the future of 4D imaging

How to study the hidden dynamics of materials that transform themselves? Our university is part of the new #MSCA Doctoral Network #XCELERATE, pioneering 4D X-ray tomography methods.

Two fully funded doctoral positions are available at our university:

20.10.2025 07:10 — 👍 1 🔁 1 💬 1 📌 0

Sometimes reviewers still manage to surprise me.

A reviewer suggested, we should do a field study for something were we argue that is criminal.

We are now planning do address this and wondering if we the reviewer rather wants us to commit crimes or to become criminal investigators 🤨

17.10.2025 10:04 — 👍 1 🔁 0 💬 0 📌 0

And now some poor engineer at OpenAI has to design this emoji and add new training data. Good times.

10.10.2025 07:21 — 👍 1 🔁 0 💬 0 📌 0

Very interesting paper that shows that LLMs are good. But not good enough. I wish the following from the conclusion would have made it into the abstract:

08.10.2025 06:49 — 👍 1 🔁 2 💬 0 📌 0

😂

02.10.2025 14:23 — 👍 0 🔁 0 💬 0 📌 0

Studying memorization of large language models using answers to... Large Language Models (LLMs) are capable of answering many software related questions and supporting developers by generating code snippets. These capabilities originate from training on massive...

Just accepted at TMLR:

We found evidence of copyright violations by LLMs even when we ask questions that were not part of the training. Indeed, we found that the amount of memorized content was independent from the questions being part of the training or not.

openreview.net/forum?id=ddo...

10.09.2025 07:12 — 👍 6 🔁 2 💬 0 📌 0

Why language models hallucinate OpenAI’s new research explains why language models hallucinate. The findings show how improved evaluations can enhance AI reliability, honesty, and safety.

This just in: Leading AI firm discovers confidence thresholds. More on this exciting development in news at 11.

openai.com/index/why-la...

(Honestly, OpenAI!?)

09.09.2025 06:31 — 👍 1 🔁 0 💬 0 📌 0

Original post on mastodon.social

Scientific impact and achievement, redefined:
Huge congrats to #Fraunhofer IIS on winning an #Emmy for their JPEG XS compression standard 🏆🎉 […]

04.09.2025 07:58 — 👍 2 🔁 1 💬 0 📌 0

re

(I miss IRC)

(Now I feel old)

01.09.2025 06:38 — 👍 2 🔁 0 💬 0 📌 0

Dear all,

please enjoy your complementary "European Professor goes on Holiday" message.

See you in September.

Yours sincerely,
A European Professor

08.08.2025 18:04 — 👍 6 🔁 0 💬 0 📌 0

Good news (for me!) my gender bias paper from 2023 still replicates with GPT-5.
Bad news (for everyone!) my gender bias paper from 2023 still replicates with GPT-5.
arxiv.org/pdf/2308.14921
hkotek.com/blog/gender-...

08.08.2025 01:19 — 👍 152 🔁 45 💬 1 📌 3

I wonder what my PhD students will think, once they discover that "someone" glued the three laws to the wall in the hallway. 🙃

06.08.2025 14:01 — 👍 2 🔁 1 💬 0 📌 0

Newton's Laws of Graduation, Part 2 - The Second Law

04.08.2025 18:47 — 👍 46 🔁 10 💬 1 📌 1

Newton's Laws of Graduation, Part 3 - The Third Law 😆

06.08.2025 12:50 — 👍 46 🔁 9 💬 3 📌 0

Success, a luxury problem, and its solution:
🎉 Our quiz is a huge success and incredibly popular on YouTube with now over 100,000 views.
😐 We cannot answer all the feedback and comments individually anymore.
😀 We write a follow up article to answer the most important questions.

29.07.2025 09:32 — 👍 3 🔁 0 💬 0 📌 0

Partial Colexifications Improve Concept Embeddings Arne Rubehn, Johann-Mattis List. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025.

It is official, our two long papers at #ACL2025 have now been published. Common work with Arne Rubehn (Concept Embeddings), and Frederic Blum and @sherbold.bsky.social (Automated Language Affiliation).

aclanthology.org/2025.acl-lon...
aclanthology.org/2025.acl-lon...

23.07.2025 10:40 — 👍 4 🔁 1 💬 0 📌 0

My debut as TV-Show moderator - now live on Youtube.

We had a lot of fun with how the five professors answered questions on topics ranging from 90's music, counting peas, size of Asian countries, etc.

The only drawback: it is only available in German.

P.S. The humans won.

21.07.2025 09:21 — 👍 5 🔁 1 💬 1 📌 0

Posts by Steffen Herbold (@sherbold.bsky.social)