Madelon Hulsebos @madelonhulsebos

I'm hiring PhDs through the ELLIS PhD program for research on table representation learning to democratize insights from tabular data!

Info: trl-lab.github.io/open-positio...
Apply: apply.ellis.eu (31 Oct)

Join us in beautiful Amsterdam at @cwi-amsterdam.bsky.social & @ellisamsterdam.bsky.social ✨

01.10.2025 18:48 — 👍 5 🔁 1 💬 0 📌 0

This exciting workshop is organized by @effylix.bsky.social, @lennartpurucker.bsky.social, Peter Baile Chen, Frank Hutter and me.

With talks by Marine Le Morvan (Inria), Floris Geerts (University of Antwerp) and @akhtarmubashara.bsky.social (ETH Zurich), and more TBA.

We hope to meet you there!!

23.09.2025 10:44 — 👍 0 🔁 0 💬 0 📌 0

Announcement for the EurIPS'25 Workshop on AI for Tabular Data. More information about the workshop can be found at: https://sites.google.com/view/eurips25-ai-td/home.

🚀 Excited to announce the AI for Tabular Data workshop at EurIPS 2025 in Copenhagen!

CfP: sites.google.com/view/eurips2... (papers due 20 Oct)

Join us @euripsconf.bsky.social to discuss neural tabular models and systems for predictive ML, tabular reasoning and retrieval, table synthesis and more ✨

23.09.2025 10:44 — 👍 7 🔁 3 💬 1 📌 1

VLDB 2025 - Industrial Track Papers List of accepted industrial track papers.

Also hope attendees will enjoy the tutorial program, which I helped compile this year, with exciting tutorials on topics such as vector search, data discovery, graph databases, AI and relational data! Full list at vldb.org/2025/?papers...

01.09.2025 11:05 — 👍 1 🔁 0 💬 0 📌 0

Off to London 🇬🇧 for @vldb.bsky.social!

Looking forward to the panel discussion on neural models and tabular data Tue 10:45 🔥. I’ll share my thoughts on the state of affairs, promising paradigms, and future directions for TRL.

Will share my input and some take-aways of the discussions post conf.

01.09.2025 11:05 — 👍 4 🔁 0 💬 1 📌 0

We show that metrics like sacrebleu and bertscore aren't fit for tabular QA eval of LLMs as scores are inseparable. We also find a large gap between multiple-choice eval as in TQA-Bench and an LLM-judge which aligns with human annotation.

Bottomline: LLMs aren't robust for real-world multi-table QA

30.07.2025 22:24 — 👍 0 🔁 0 💬 0 📌 0

TRL Lab

In our contribution led by @cowolff.bsky.social from the TRL Lab (trl-lab.github.io) we wondered: how well do LLMs reason over tabular data, really? We find that LLMs don't ack nor handle tables with disturbances (eg missing vals or duplicates) necessitating explicit prompting or cleaning pipelines.

30.07.2025 22:24 — 👍 1 🔁 0 💬 1 📌 0

How well do LLMs reason over tabular data, really? Large Language Models (LLMs) excel in natural language tasks, but less is known about their reasoning capabilities over tabular data. Prior analyses devise evaluation strategies that poorly reflect an...

The full program of the workshop is at: table-representation-learning.github.io/ACL2025/. Besides excellent contributed work (proceedings: aclanthology.org/2025.trl-1.0...), we'll have invited talks by Dan Roth, Tao Yu, Edward Choi and Julian Eisenschlos!

30.07.2025 22:24 — 👍 0 🔁 0 💬 1 📌 0

Excited to be at ACL! Join us at the Table Representation Learning workshop tomorrow in room 2.15 to talk about tables and AI.

We also present a paper showing the sensitivity of LLMs in tabular reasoning to e.g. missing vals and duplicates, by @cowolff.bsky.social at 16:50: arxiv.org/abs/2505.07453

30.07.2025 22:24 — 👍 6 🔁 3 💬 1 📌 0

The paper's called:
"How well do LLMs reason over tabular data, really?" 📊

We dig into two important questions:
1️⃣ Are general-purpose LLMs robust with real-world tables?
2️⃣ How should we actually evaluate them? (2/4)

25.07.2025 15:06 — 👍 1 🔁 1 💬 1 📌 0

Very excited about this! Aligns well with my 🇪🇺 focus this year.

17.07.2025 10:43 — 👍 5 🔁 0 💬 0 📌 0

Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences

Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social 🤩

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...

16.06.2025 07:49 — 👍 8 🔁 5 💬 0 📌 0

Very interesting! I’ve seen synthetic data, with certain patterns (eg TabPFN with SCMs), come real far, but not entirely random compositions of tokens. Do you understand or have an hypothesis for what leads to this observation?

26.06.2025 11:17 — 👍 0 🔁 0 💬 0 📌 0

🚨What is SOTA on tabular data, really? We are excited to announce 𝗧𝗮𝗯𝗔𝗿𝗲𝗻𝗮, a living benchmark for machine learning on IID tabular data with:

📊 an online leaderboard (submit!)
📑 carefully curated datasets
📈 strong tree-based, deep learning, and foundation models

🧵

23.06.2025 10:14 — 👍 13 🔁 8 💬 1 📌 0

Let’s collaborate on democratizing insights from tabular data in Amsterdam! ✨

PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.

Sharing w/ friends appreciated! ⬇️

05.06.2025 15:36 — 👍 2 🔁 3 💬 1 📌 0

Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences

Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social 🤩

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...

16.06.2025 07:49 — 👍 8 🔁 5 💬 0 📌 0

Absolutely! Please send me an email then we can arrange a chat :)

09.06.2025 06:23 — 👍 1 🔁 0 💬 1 📌 0

Let’s collaborate on democratizing insights from tabular data in Amsterdam! ✨

PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.

Sharing w/ friends appreciated! ⬇️

05.06.2025 15:36 — 👍 2 🔁 3 💬 1 📌 0

Open positions | TRL Lab

🏹 Job alert: 2 fully-funded PhD Positions at Table Representation Learning Lab - @ellisamsterdam.bsky.social‬

📍 Amsterdam 🇳🇱
📅 Apply by June 30
🔗 More info: https://bit.ly/4519pj1

05.06.2025 15:00 — 👍 1 🔁 3 💬 0 📌 1

"Can LLMs really reason over tabular data, really?"
That’s the title and central question of my first paper in my new role as a PhD student, which has been accepted to the 4th Table Representation Learning Workshop @ ACL 2025! arxiv.org/pdf/2505.07453

🧵Here’s what we found:

28.05.2025 10:03 — 👍 2 🔁 1 💬 1 📌 0

Open positions | TRL Lab

Eager to contribute to democratizing insights from tabular data? We have 2 new PhD openings! ✨

1) Fundamental Techniques in Table Representation Learning
2) Reliable AI-powered Tabular Data Analysis Systems

⏰ Apply by: 30 June 2025
📅 Start: Fall/Winter 2025
🔗 Info: trl-lab.github.io/open-positions

22.05.2025 18:56 — 👍 4 🔁 3 💬 0 📌 0

Open positions | TRL Lab

Eager to contribute to democratizing insights from tabular data? We have 2 new PhD openings! ✨

1) Fundamental Techniques in Table Representation Learning
2) Reliable AI-powered Tabular Data Analysis Systems

⏰ Apply by: 30 June 2025
📅 Start: Fall/Winter 2025
🔗 Info: trl-lab.github.io/open-positions

22.05.2025 18:56 — 👍 4 🔁 3 💬 0 📌 0

This is really cool!! 👏

24.04.2025 06:25 — 👍 0 🔁 0 💬 0 📌 0

TRL reading group | TRL Lab

Curious to hear other thoughts on this nice survey. We’ll continue doing this going forward, and everyone is welcome to join: trl-lab.github.io/trl-reading-.... Also for the monthly seminar btw ✨

18.04.2025 10:21 — 👍 3 🔁 0 💬 0 📌 0

TRL reading group discussion: "Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding - A Survey" | TRL Lab Discussion protocol and results of the TRL reading group on the paper

We’re running a TRL reading group, and captured some reflections on the survey “Large Language Models on Tabular Data: Prediction, Generation, and Understanding“: trl-lab.github.io/blog/2025/re... by @daniel-gomm.bsky.social @effylix.bsky.social @zeyuzhang.bsky.social @cowolff.bsky.social and me!

18.04.2025 10:19 — 👍 8 🔁 1 💬 1 📌 0

We're currently finalizing some quite detailed insights on this (can share the paper next week, and will send some plots on DM). TLDR is that an LLM-as-a-judge is best, for now, for evaluating free-form text against precise answers. Methods like sacrebleu and bert-score do not give a clear signal...

17.04.2025 14:02 — 👍 2 🔁 0 💬 1 📌 1

This is also an issue for evaluation.. one approach could be to make a second LLM call to extract the concise answer / value from generated text.

17.04.2025 13:52 — 👍 1 🔁 0 💬 1 📌 0

This talk is today, 4-5pm UvA lab42 Amsterdam and Zoom. Looking forward 🤩

11.04.2025 10:21 — 👍 1 🔁 0 💬 0 📌 0

Good news for authors of submissions to the Table Representation Learning workshop @ ACL: we're extending the deadline a little bit. Papers are due 21 April. No exceptions from there :).

08.04.2025 09:55 — 👍 1 🔁 1 💬 0 📌 0

Details about the seminar talk titled TabICL: A Tabular Foundation Model for In-Context Learning on Large Data by Marine Le Morvan

Excited to share the new monthly Table Representation Learning (TRL) Seminar under the ELLIS Amsterdam TRL research theme! To recur every 2nd Friday.

Who: Marine Le Morvan, Inria (in-person)
When: Friday 11 April 4-5pm (+drinks)
Where: L3.36 Lab42 Science Park / Zoom

trl-lab.github.io/trl-seminar/

02.04.2025 09:42 — 👍 12 🔁 3 💬 0 📌 1

Madelon Hulsebos

Latest posts by madelonhulsebos.bsky.social on Bluesky

@madelonhulsebos is following 20 prominent accounts