Madelon Hulsebos's Avatar

Madelon Hulsebos

@madelonhulsebos.bsky.social

Faculty at CWI & ELLIS Amsterdam https://trl-lab.github.io. Prev at UC Berkeley and the University of Amsterdam. Research on AI and tabular data to democratize insights from structured data. https://www.madelonhulsebos.com

1,349 Followers  |  486 Following  |  82 Posts  |  Joined: 28.10.2024  |  2.3538

Latest posts by madelonhulsebos.bsky.social on Bluesky

I'm hiring PhDs through the ELLIS PhD program for research on table representation learning to democratize insights from tabular data!

Info: trl-lab.github.io/open-positio...
Apply: apply.ellis.eu (31 Oct)

Join us in beautiful Amsterdam at @cwi-amsterdam.bsky.social & @ellisamsterdam.bsky.social โœจ

01.10.2025 18:48 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This exciting workshop is organized by @effylix.bsky.social, @lennartpurucker.bsky.social, Peter Baile Chen, Frank Hutter and me.

With talks by Marine Le Morvan (Inria), Floris Geerts (University of Antwerp) and @akhtarmubashara.bsky.social (ETH Zurich), and more TBA.

We hope to meet you there!!

23.09.2025 10:44 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Announcement for the EurIPS'25 Workshop on AI for Tabular Data. More information about the workshop can be found at: https://sites.google.com/view/eurips25-ai-td/home.

Announcement for the EurIPS'25 Workshop on AI for Tabular Data. More information about the workshop can be found at: https://sites.google.com/view/eurips25-ai-td/home.

๐Ÿš€ Excited to announce the AI for Tabular Data workshop at EurIPS 2025 in Copenhagen!

CfP: sites.google.com/view/eurips2... (papers due 20 Oct)

Join us @euripsconf.bsky.social to discuss neural tabular models and systems for predictive ML, tabular reasoning and retrieval, table synthesis and more โœจ

23.09.2025 10:44 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
VLDB 2025 - Industrial Track Papers List of accepted industrial track papers.

Also hope attendees will enjoy the tutorial program, which I helped compile this year, with exciting tutorials on topics such as vector search, data discovery, graph databases, AI and relational data! Full list at vldb.org/2025/?papers...

01.09.2025 11:05 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Off to London ๐Ÿ‡ฌ๐Ÿ‡ง for @vldb.bsky.social!

Looking forward to the panel discussion on neural models and tabular data Tue 10:45 ๐Ÿ”ฅ. Iโ€™ll share my thoughts on the state of affairs, promising paradigms, and future directions for TRL.

Will share my input and some take-aways of the discussions post conf.

01.09.2025 11:05 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We show that metrics like sacrebleu and bertscore aren't fit for tabular QA eval of LLMs as scores are inseparable. We also find a large gap between multiple-choice eval as in TQA-Bench and an LLM-judge which aligns with human annotation.

Bottomline: LLMs aren't robust for real-world multi-table QA

30.07.2025 22:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
TRL Lab

In our contribution led by @cowolff.bsky.social from the TRL Lab (trl-lab.github.io) we wondered: how well do LLMs reason over tabular data, really? We find that LLMs don't ack nor handle tables with disturbances (eg missing vals or duplicates) necessitating explicit prompting or cleaning pipelines.

30.07.2025 22:24 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
How well do LLMs reason over tabular data, really? Large Language Models (LLMs) excel in natural language tasks, but less is known about their reasoning capabilities over tabular data. Prior analyses devise evaluation strategies that poorly reflect an...

The full program of the workshop is at: table-representation-learning.github.io/ACL2025/. Besides excellent contributed work (proceedings: aclanthology.org/2025.trl-1.0...), we'll have invited talks by Dan Roth, Tao Yu, Edward Choi and Julian Eisenschlos!

30.07.2025 22:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Excited to be at ACL! Join us at the Table Representation Learning workshop tomorrow in room 2.15 to talk about tables and AI.

We also present a paper showing the sensitivity of LLMs in tabular reasoning to e.g. missing vals and duplicates, by @cowolff.bsky.social at 16:50: arxiv.org/abs/2505.07453

30.07.2025 22:24 โ€” ๐Ÿ‘ 6    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

The paper's called:
"How well do LLMs reason over tabular data, really?" ๐Ÿ“Š

We dig into two important questions:
1๏ธโƒฃ Are general-purpose LLMs robust with real-world tables?
2๏ธโƒฃ How should we actually evaluate them? (2/4)

25.07.2025 15:06 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Very excited about this! Aligns well with my ๐Ÿ‡ช๐Ÿ‡บ focus this year.

17.07.2025 10:43 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences

Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social ๐Ÿคฉ

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...

16.06.2025 07:49 โ€” ๐Ÿ‘ 8    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Very interesting! Iโ€™ve seen synthetic data, with certain patterns (eg TabPFN with SCMs), come real far, but not entirely random compositions of tokens. Do you understand or have an hypothesis for what leads to this observation?

26.06.2025 11:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

๐ŸšจWhat is SOTA on tabular data, really? We are excited to announce ๐—ง๐—ฎ๐—ฏ๐—”๐—ฟ๐—ฒ๐—ป๐—ฎ, a living benchmark for machine learning on IID tabular data with:

๐Ÿ“Š an online leaderboard (submit!)
๐Ÿ“‘ carefully curated datasets
๐Ÿ“ˆ strong tree-based, deep learning, and foundation models

๐Ÿงต

23.06.2025 10:14 โ€” ๐Ÿ‘ 13    ๐Ÿ” 8    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Letโ€™s collaborate on democratizing insights from tabular data in Amsterdam! โœจ

PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.

Sharing w/ friends appreciated! โฌ‡๏ธ

05.06.2025 15:36 โ€” ๐Ÿ‘ 2    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Proceedings of the Workshop on Data Management for End-to-End Machine Learning | ACM Conferences

Join us for discussions and talks on data management aspects for end-to-end ML on 27 June at @deem-workshop.bsky.social in Berlin. Keynotes by @pinartozun.bsky.social and @gaelvaroquaux.bsky.social ๐Ÿคฉ

Check the full schedule deem-workshop.github.io#schedule & proceedings dl.acm.org/doi/proceedi...

16.06.2025 07:49 โ€” ๐Ÿ‘ 8    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Absolutely! Please send me an email then we can arrange a chat :)

09.06.2025 06:23 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Letโ€™s collaborate on democratizing insights from tabular data in Amsterdam! โœจ

PhD directions: 1) fundamental techniques for tabular foundation models, 2) reliable mechanisms for AI-powered tabular data analysis.

Sharing w/ friends appreciated! โฌ‡๏ธ

05.06.2025 15:36 โ€” ๐Ÿ‘ 2    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Open positions | TRL Lab

๐Ÿน Job alert: 2 fully-funded PhD Positions at Table Representation Learning Lab - @ellisamsterdam.bsky.socialโ€ฌ

๐Ÿ“ Amsterdam ๐Ÿ‡ณ๐Ÿ‡ฑ
๐Ÿ“… Apply by June 30
๐Ÿ”— More info: https://bit.ly/4519pj1

05.06.2025 15:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

"Can LLMs really reason over tabular data, really?"
Thatโ€™s the title and central question of my first paper in my new role as a PhD student, which has been accepted to the 4th Table Representation Learning Workshop @ ACL 2025! arxiv.org/pdf/2505.07453

๐ŸงตHereโ€™s what we found:

28.05.2025 10:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Open positions | TRL Lab

Eager to contribute to democratizing insights from tabular data? We have 2 new PhD openings! โœจ

1) Fundamental Techniques in Table Representation Learning
2) Reliable AI-powered Tabular Data Analysis Systems

โฐ Apply by: 30 June 2025
๐Ÿ“… Start: Fall/Winter 2025
๐Ÿ”— Info: trl-lab.github.io/open-positions

22.05.2025 18:56 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Open positions | TRL Lab

Eager to contribute to democratizing insights from tabular data? We have 2 new PhD openings! โœจ

1) Fundamental Techniques in Table Representation Learning
2) Reliable AI-powered Tabular Data Analysis Systems

โฐ Apply by: 30 June 2025
๐Ÿ“… Start: Fall/Winter 2025
๐Ÿ”— Info: trl-lab.github.io/open-positions

22.05.2025 18:56 โ€” ๐Ÿ‘ 4    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

This is really cool!! ๐Ÿ‘

24.04.2025 06:25 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
TRL reading group | TRL Lab

Curious to hear other thoughts on this nice survey. Weโ€™ll continue doing this going forward, and everyone is welcome to join: trl-lab.github.io/trl-reading-.... Also for the monthly seminar btw โœจ

18.04.2025 10:21 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
TRL reading group discussion: "Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding - A Survey" | TRL Lab Discussion protocol and results of the TRL reading group on the paper

Weโ€™re running a TRL reading group, and captured some reflections on the survey โ€œLarge Language Models on Tabular Data: Prediction, Generation, and Understandingโ€œ: trl-lab.github.io/blog/2025/re... by @daniel-gomm.bsky.social @effylix.bsky.social @zeyuzhang.bsky.social @cowolff.bsky.social and me!

18.04.2025 10:19 โ€” ๐Ÿ‘ 8    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We're currently finalizing some quite detailed insights on this (can share the paper next week, and will send some plots on DM). TLDR is that an LLM-as-a-judge is best, for now, for evaluating free-form text against precise answers. Methods like sacrebleu and bert-score do not give a clear signal...

17.04.2025 14:02 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

This is also an issue for evaluation.. one approach could be to make a second LLM call to extract the concise answer / value from generated text.

17.04.2025 13:52 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This talk is today, 4-5pm UvA lab42 Amsterdam and Zoom. Looking forward ๐Ÿคฉ

11.04.2025 10:21 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Good news for authors of submissions to the Table Representation Learning workshop @ ACL: we're extending the deadline a little bit. Papers are due 21 April. No exceptions from there :).

08.04.2025 09:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Details about the seminar talk titled TabICL: A Tabular Foundation Model for In-Context Learning on Large Data by Marine Le Morvan

Details about the seminar talk titled TabICL: A Tabular Foundation Model for In-Context Learning on Large Data by Marine Le Morvan

Excited to share the new monthly Table Representation Learning (TRL) Seminar under the ELLIS Amsterdam TRL research theme! To recur every 2nd Friday.

Who: Marine Le Morvan, Inria (in-person)
When: Friday 11 April 4-5pm (+drinks)
Where: L3.36 Lab42 Science Park / Zoom

trl-lab.github.io/trl-seminar/

02.04.2025 09:42 โ€” ๐Ÿ‘ 12    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

@madelonhulsebos is following 20 prominent accounts