@tuetschek.bsky.social
Teaching computers to talk at Charles University. (Computational) linguistics, politics, climate, public transit. He/him.
Picture of the One Pillar Pagoda in Hanoi, a pagoda raised up over a green pond surrounded by greenery
The registration page for #INLG2025 is now live! Join us in Vietnam at the Oct 29 - Nov 2 for the best conference on #NaturalLanguageGeneration
2025.inlgmeeting.org/registration...
Curious to see what will be presented? Check out this list of accepted papers! 2025.inlgmeeting.org/accepted-pap...
Check out the slides from our SCAI'2025 #convsearch workshop collocated with @ijcai.org #IJCAI2025 on LLMs, retrieval & QA, recommendations, negotiations, evaluation and transparency
scai.info/scai-2025
@patuchen.bsky.social @maik-froebe.bsky.social @tuetschek.bsky.social @mila-quebec.bsky.social
Our paper "OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs" has been accepted to #INLG2025 conference!
You can read the preprint here: arxiv.org/abs/2503.11858
It's fine by me if they generate it, as long as it works and they know how... but I've been getting loads of roughly plausible but non-functional code, with hallucinated API calls etc. ๐. Not that many emojis though (in docs only).
03.08.2025 16:22 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0FreshTab: Sourcing Fresh Resources for Table-to-Text Generation Evaluation
by @navitas.bsky.social, โช@oplatek.bsky.socialโฌ, โช@zdenekkasner.bsky.socialโฌ, @tuetschek.bsky.social .bsky.socialโฌ
ReproHum #0669-08: Reproducing Sentiment Transfer Evaluation
by @navitas.bsky.social, M. Lango, @patuchen.bsky.social, @tuetschek.bsky.social
Challenge to reproduce human evaluations from NLP papers, testing the reproducibility of evaluation studies
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
by @ivankartac.bsky.social, M. Lango, @tuetschek.bsky.social
arxiv.org/abs/2503.11858
Open-source NLG evaluation metric that explains errors and matches human judgments without proprietary models
#ACL2025NLP in Vienna ๐ฆ๐น starts today with 23 ๐คฏ @ufal-cuni.bsky.social folks presenting their work both at the main conference and workshops. Check out our main conference papers today and on Wednesday ๐
28.07.2025 07:27 โ ๐ 22 ๐ 8 ๐ฌ 1 ๐ 1ICML found hidden prompts in accepted papers. They have released a statement icml.cc/Conferences/...
Yes, itโs unacceptable. So is using an LLM to review a paper. Peer review is so broken.
#Eurovision is tonight, and here's a hilarious fun fact about it: Israel has started a massive offensive against civilian populations in Gaza with the explicit aim of conquering the entire territory and ethnically cleansing its population, and Eurovision has aggressively refused to give a shit.
17.05.2025 22:48 โ ๐ 2764 ๐ 1042 ๐ฌ 10 ๐ 20It is a little weird to me countries arenโt more aggressively, formally trying to take advantage of the U.S. science brain drain. Once in a lifetime opportunity to buy low on Non-Dumbass Americans with PhDs who just wanna look into microscopes and quietly cure ass cancer as our country eats shit.
04.05.2025 20:54 โ ๐ 35035 ๐ 4855 ๐ฌ 1251 ๐ 425Slides and links to papers at bit.ly/mlprague25-od ๐ค
02.05.2025 19:25 โ ๐ 2 ๐ 2 ๐ฌ 0 ๐ 0The ๐Machine Learning Prague 2025๐ is happening right now! Today, @patuchen.bsky.social and @navitas.bsky.social presented their posters on text generation with LLMs. Also, don't miss @tuetschek.bsky.social's invited talk tomorrow at 11 a.m.
29.04.2025 14:08 โ ๐ 11 ๐ 5 ๐ฌ 1 ๐ 0๐จ NEW WORKSHOP ALERT ๐จ
We're thrilled to announce the first-ever Tokenization Workshop (TokShop) at #ICML2025 @icmlconf.bsky.social! ๐
Submissions are open for work on tokenization across all areas of machine learning.
๐
Submission deadline: May 30, 2025
๐ tokenization-workshop.github.io
โis my calculator horny?โ our tech columnist asks. โi entered 5318008 into it and turned it upside down. what i saw surprised meโ
24.04.2025 18:24 โ ๐ 16640 ๐ 3813 ๐ฌ 162 ๐ 48How do LLMs compare to human crowdworkers in annotating text spans? ๐ง๐ค
And how can span annotation help us with evaluating texts?
Find out in our new paper: llm-span-annotators.github.io
Arxiv: arxiv.org/abs/2504.08697
Participate in the ๐ CRAC 2025 Shared Task on Multilingual Coreference Resolutionโ ufal.mff.cuni.cz/corefud/crac25
If you have not already done so, register first. ๐ Then start discovering how words refer to each other in 1๏ธโฃ7๏ธโฃ languages. This year includes a new โจLLMโจ track ๐ฎ.
A 3-year full-time post-doc position in Prague! I'll be grateful for reposts. Feel free to get in touch if you have questions. linguistlist.org/issues/36/10...
24.03.2025 16:10 โ ๐ 35 ๐ 38 ๐ฌ 0 ๐ 0Itโs kind of quaint to think the big worry a few years ago was that AI chatbots would destroy humanity. Weโre quite capable of doing that without their help.
22.03.2025 11:09 โ ๐ 141 ๐ 11 ๐ฌ 6 ๐ 1๐จโ๐ป๐ฉโ๐ป Pod vedenรญm @ufal-cuni.bsky.social #MFFUK @unikarlova.cuni.cz se zaฤรญnรก budovat rodina velkรฝch jazykovรฝch modelลฏ pro vลกechny evropskรฉ jazyky. V Karolinu dnes odstartoval mezinรกrodnรญ projekt @openeurollm.bsky.social. ๐
www.ukforum.cz/rubriky/aktu...
Sadly they're not "exposed" to the millions of people who don't follow politics and consume shitty media ๐
27.02.2025 20:30 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0The CVPR program chairs acted on the reviewing crisis in ML/CV conferences. This is a first!
Papers of authors who acted as irresponsable reviewers have been desk rejected. 19 papers are affected in 2025.
This has also been announced for ICCV 2025.
The 6th International Workshop on Designing Meaning Representation (#DMR2025) will be in Prague, Aug 4-5, right after #ACL2025 in Vienna!
Submit your work on meaning representations: annotation, parsing, multilinguality, neuro-symbolic methods & more. Details: dmr2025.github.io/index
Ok here's the theory so far for how the LLM-generated ARR submission experiment went down. Researchers generated papers, then created fake reviewer profiles by identifying existing papers from a real author st the set of papers would maximize expertise affinity score wrt the LLM-generated paper.
12.02.2025 00:37 โ ๐ 27 ๐ 7 ๐ฌ 2 ๐ 1Iโm doing the same. Itโs important to do what we can. Iโve changed to Signal where possible, Proton mail, Startpage browser/search or , Qwant search on Firefox. This is a good site for European alternatives: european-alternatives.eu
23.02.2025 11:04 โ ๐ 69 ๐ 24 ๐ฌ 3 ๐ 2The image shows a ChatGPT conversation where someone requested a recipe for pancakes using whole rolled oats without a blender, with measurements in grams. The recipe ingredients are listed as: - 150g whole rolled oats - 250ml milk (dairy or plant-based) - 2 large eggs (approximately 100g) - 30g melted butter (or neutral oil) + extra for cooking - 20g sugar (optional, adjust to taste) - 8g baking powder (about 2 tsp) - 2g salt (about 1/2 tsp) - 5ml vanilla extract (optional)
This image shows a webpage from "the kitchn" website featuring a recipe for "Easy Oatmeal Pancakes." The recipe appears to be highly rated with 5 stars from 64 reviews. The page shows a navigation path of RECIPES > BREAKFAST > PANCAKES, and there's a banner at the top advertising "Our Laziest, Most Delicious Dinners Ever (Ready in 30 Minutes or Less)." The image also shows part of a photo featuring what appears to be melted butter being whisked into a mixture. There's a Google sign-in prompt overlaying part of the page.
The rise of ChatGPT isn't just about AIโit's about how cluttered the web has become. Compare these recipes: ChatGPT gives me ingredients instantly, while a cooking website shows me an auto-playing video and a prompt to log into Google so I can see an ad
09.02.2025 09:29 โ ๐ 99 ๐ 12 ๐ฌ 13 ๐ 5We've been making the media rounds!
๐๐บ @hajicjan.bsky.social talked about the new OpenEuroLLM project on Czech TV's Studio 6 www.ceskatelevize.cz/porady/10969...
๐๐ป @tuetschek.bsky.social discussed #LLMs on Czech Radio radiozurnal.rozhlas.cz/proc-umela-i...).
Good that they see Trump for what he is, but it's ridiculous they rank Putin so low ๐
01.02.2025 18:19 โ ๐ 5 ๐ 0 ๐ฌ 1 ๐ 0