Fabian David Schmidt's Avatar

Fabian David Schmidt

@fdschmidt.bsky.social

PhD candidate at Uni of Wรผrzburg working on multilinguality & multimodality | prev. visited visit Mila & LTL@UniCambridge https://fdschmidt93.github.io

187 Followers  |  54 Following  |  5 Posts  |  Joined: 20.11.2024  |  1.7503

Latest posts by fdschmidt.bsky.social on Bluesky

Only 10 days left to submit your work to our International Workshop on News Recommendation and Analytics! ๐Ÿš€

โ–ถ๏ธ More details: research.idi.ntnu.no/NewsTech/INR...

๐Ÿ“† Submission deadline: July 17th, 2025 AoE

๐Ÿ“ Event co-located with @recsys.bsky.social
in Prague on September 26th (tentative)!

07.07.2025 11:51 โ€” ๐Ÿ‘ 4    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

๐Ÿ“ข Introducing Walk&Retrieve, a simple yet effective zero-shot #RAG framework based on #knowledgegraph walks!

Arxiv : arxiv.org/abs/2505.16849
GitHub: github.com/MartinBoeckl...

Joint work w/ @martinboeckling.bsky.social @heikopaulheim.bsky.social

Details ๐Ÿ‘‡

23.05.2025 12:48 โ€” ๐Ÿ‘ 6    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Title slide: Processing Trans Languaging - Vagrant Gautam (they/xe), Saarland University, with a very brightly patterned background featuring colourful people and math symbols.

Title slide: Processing Trans Languaging - Vagrant Gautam (they/xe), Saarland University, with a very brightly patterned background featuring colourful people and math symbols.

Come to my keynote tomorrow at the first official @queerinai.com workshop at #NAACL2025 to hear about how trans languaging is complex and cool, and how this makes it extra difficult to process computationally. I will have SO many juicy examples!

03.05.2025 20:52 โ€” ๐Ÿ‘ 44    ๐Ÿ” 14    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Diagram illustrating a hypothesis about knowledge unlearning in language models. The left side shows a training corpus with varying frequencies of facts, such as 'Montreal is a city in Quebec' (high frequency) and 'Atlantis is a city in the ocean' (lower frequency). The center shows a language model being trained on this data, then undergoing unlearning. The right side demonstrates the 'Forget Quality' results, where the model more effectively unlearns the less frequent fact ('Atlantis is in Greece') while retaining the more frequent knowledge. Labels A, B, and C mark key points in the hypothesis: A (frequency variations in training data), B (influence of frequency), and C (unlearning effectiveness).

Diagram illustrating a hypothesis about knowledge unlearning in language models. The left side shows a training corpus with varying frequencies of facts, such as 'Montreal is a city in Quebec' (high frequency) and 'Atlantis is a city in the ocean' (lower frequency). The center shows a language model being trained on this data, then undergoing unlearning. The right side demonstrates the 'Forget Quality' results, where the model more effectively unlearns the less frequent fact ('Atlantis is in Greece') while retaining the more frequent knowledge. Labels A, B, and C mark key points in the hypothesis: A (frequency variations in training data), B (influence of frequency), and C (unlearning effectiveness).

Check out our new paper on unlearning for LLMs ๐Ÿค–. We show that *not all data are unlearned equally* and argue that future work on LLM unlearning should take properties of the data to be unlearned into account. This work was lead by my intern @a-krishnan.bsky.social
๐Ÿ”—: arxiv.org/abs/2504.05058

09.04.2025 13:30 โ€” ๐Ÿ‘ 32    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Post image

๐Ÿ“ฃ Call for Papers is out! ๐Ÿ“ฃ

Working on #news #recsys & their societal, legal, and ethical dimensions?

๐Ÿ‘‰ Submit to the 13th INRA workshop, co-located w/ @recsys.bsky.social in Prague!

๐Ÿ“… Paper deadline: ** July 17th, 2025 **

More info: research.idi.ntnu.no/NewsTech/INR...

#INRA2025 #RecSys2025

05.05.2025 12:37 โ€” ๐Ÿ‘ 3    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Hello! INRA is a forum for researchers and practitioners to discuss technical innovations, societal, ethical, and legal aspects of news recommendation and analytics.

The upcoming 13th edition of our workshop will be co-located w/ @recsys.bsky.social in Prague.

Stay tuned to this channel!

02.05.2025 08:14 โ€” ๐Ÿ‘ 3    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Joint work with Florian Schneider, Chris Biemann, and @gglavas.bsky.social

My first paper on multilingual vision-language, and couldn't be happier how this work turned out!๐Ÿ™‚

21.02.2025 07:45 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

Cross-modal topic matching correlates well with other multilingual vision-language tasks!

๐Ÿค—Images-To-Sentence (given Images, select topically fitting sentence) & Sentences-To-Image (given Sentences, pick topically matching image) probe complementary aspects in VLU

21.02.2025 07:45 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

X-modal to text-only perf. *gap* shows that VL support decreases from high to low-resource language tiers:

Images/Topicโ†’Sentence (for I/T, pick S): narrows with less textual support (left)
Sentencesโ†’Image/Topic (for S, pick I/T): increases with less VL support worse (right)

21.02.2025 07:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Strong vision-language models (VLMs) like GPT-4o-mini maintain good performance for top-150 languages, only to drop to performing no better than chance for the lowest resource languages!

21.02.2025 07:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Introducing MVL-SIB, a massively multilingual vision-language benchmark for cross-modal topic matching in 205 languages!

๐Ÿค”Tasks: Given images (sentences), select topically matching sentence (image).

Arxiv: arxiv.org/abs/2502.12852
HF: huggingface.co/datasets/Wue...

Details๐Ÿ‘‡

21.02.2025 07:45 โ€” ๐Ÿ‘ 4    ๐Ÿ” 5    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
International Association for Safe & Ethical AI Conference โ€” IASEAI 2025 The International Association for Safe and Ethical AI will host its inaugural conference (IASEAI โ€˜25) on Feb 6-7, 2025 at the OECD La Muette Headquarters and Conference Centre in Paris, ahead of the P...

Excited to present today a poster at @OECD in Paris @IASEAIorg based on our upcoming paper "Societal Alignment Frameworks Can Improve LLM Alignment" (stay tuned for the pre-print soon!๐ŸŽŠ). Today (Fri) at 1pm CET. Conference livestream: iaseai.org/conference

07.02.2025 09:13 โ€” ๐Ÿ‘ 8    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation Rapidly growing numbers of multilingual news consumers pose an increasing challenge to news recommender systems in terms of providing customized recommendations. First, existing neural news recommende...

โš ๏ธStruggling with multilingual news recommendation?

We introduce NaSE, a news-adapted sentence encoder!๐Ÿ™Œ
โœ…No costly fine-tuning needed
โœ…Perfect for cold-start & few-shot scenarios

Read our ECIR 2025 ๐Ÿ“ฐ: arxiv.org/abs/2406.12634
Try it out @hf.co ๐Ÿค—: huggingface.co/aiana94/NaSE

20.01.2025 12:21 โ€” ๐Ÿ‘ 21    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1

I'm making an unofficial starter pack with some of my colleagues at Mila. WIP for now but here's the link!

go.bsky.app/BHKxoss

20.11.2024 15:19 โ€” ๐Ÿ‘ 69    ๐Ÿ” 29    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 1

@fdschmidt is following 20 prominent accounts