David M. Schmidt's Avatar

David M. Schmidt

@dmschmidt.bsky.social

#NLProc PhD Student & Research Associate at Bielefeld University Working on: Question Answering over Linked Data, Semantic Web, Lexical Knowledge & Compositionality in AI https://davidmschmidt.de

102 Followers  |  673 Following  |  25 Posts  |  Joined: 16.12.2023  |  2.2378

Latest posts by dmschmidt.bsky.social on Bluesky

I am incredibly happy to share that our paper "CompoST: A Benchmark for Analyzing the Ability of LLMs To Compositionally Interpret Questions in a QALD Setting" has been accepted as a research track paper at ISWC @iswc-conf.bsky.social! Stay tuned for the paper and see you all in Japan!

18.07.2025 18:03 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

- and listened to keynotes of well-known researchers like Frank van Harmelen, Natasha Noy and Enrico Motta
A huge thanks to everyone who made this week such a memorable experience! And, if you are Master's/PhD student or PostDoc, I cannot recommend too much to apply for the next iteration of #ISWS!

16.06.2025 14:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

- worked in a research task force on building a reliable LLM-based metadata enrichment pipeline for cultural heritage objects (special thanks to our tutor Valentina Presutti and our whole team), as well as writing a corresponding white paper and presenting our results in the final session

16.06.2025 14:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

During the last week, among many other things, I
- summarized the motivation of my work in a 45s "Minute Madness" session
- presented my work during a poster session, getting helpful feedback from students and tutors (special thanks to Aidan Hogan and Stefano De Giorgis)

16.06.2025 14:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

The #ISWS2025 experience really managed to combine lots of fun activities, working with leading figures of the Semantic Web field as well as intense networking in a unique, wonderful way! It felt like a month worth of program items had been compressed to one magnificent piece of art.

16.06.2025 14:25 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
International Semantic Web Research Summer School 2025 group photo

International Semantic Web Research Summer School 2025 group photo

David M. Schmidt presenting the group's work on AI-based cultural heritage metadata enrichment

David M. Schmidt presenting the group's work on AI-based cultural heritage metadata enrichment

David M. Schmidt presenting his poster on NeoDUDES, a compositional Question Answering system using DUDES

David M. Schmidt presenting his poster on NeoDUDES, a compositional Question Answering system using DUDES

David M. Schmidt in front of the beautiful landscape of Bertinoro

David M. Schmidt in front of the beautiful landscape of Bertinoro

What a week! I just had the incredible opportunity to attend the International Semantic Web Research Summer School 2025 @isws-summerschool.bsky.social. I hoped for an intense week filled with inspiring keynotes, people and opportunities to present my work - and I got so much more than "just" that!

16.06.2025 14:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Social media is acknowledged as an important source of patient experience data to learn about patients’ unmet needs, priorities, and preferences. The objective of this study was to evaluate to what extent SOTA LLMs can appropriately summarize posts shared by patients in web-based forums.

15.04.2025 11:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Summarizing Online Patient Conversations Using Generative Language Models: Experimental and Comparative Study Background: Social media is acknowledged by regulatory bodies (eg, the Food and Drug Administration) as an important source of patient experience data to learn about patients’ unmet needs, priorities,...

πŸŽ“ Authors: Rakhi Asokkumar Subjagouri Nair, Matthias Hartung, Philipp Heinisch, Janik Jaskolski, Cornelius Starke-KnΓ€usel, Susana VerΓ­ssimo, David M. Schmidt, Philipp Cimiano

πŸ”— Paper: doi.org/10.2196/62909

15.04.2025 11:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Summarizing Online Patient Conversations Using Generative Language Models: Experimental and Comparative Study Background: Social media is acknowledged by regulatory bodies (eg, the Food and Drug Administration) as an important source of patient experience data to learn about patients’ unmet needs, priorities,...

πŸš€ New paper! πŸš€

I am happy to announce our paper "Summarizing Online Patient Conversations Using Generative Language Models: Experimental and Comparative Study," which has just been published in JMIR Medical Informatics!

15.04.2025 11:05 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
Research Position - Text Generation/Natural Langua... <div style="text-align: justify;">The Faculty of Engineering at Bielefeld University is looking for a research assistant to work on th...

NLP/Text Generation
EN: uni-bielefeld.hr4you.org/job/view/4054
DE: uni-bielefeld.hr4you.org/job/view/4053

NLP/Information Extraction
EN: uni-bielefeld.hr4you.org/job/view/4059
DE: uni-bielefeld.hr4you.org/job/view/4057

If you have any questions, do not hesitate to contact me or Philipp directly!

06.03.2025 14:40 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We currently have two fully-funded open PhD positions in our group with a focus on #NLProc, #InformationExtraction and #TextGeneration. I can really recommend both the group as well as Philipp Cimiano as a supervisor, so take this opportunity!

06.03.2025 14:40 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

πŸš€ We are #hiring! Are you interested in Natural Language Processing, Text Generation or Information Extraction and want to pursue a PhD? Then you now have the chance to become a part of the Semantic Computing Group at Bielefeld University!

Application Deadline: 20.03.2025

06.03.2025 14:39 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Spring School 2025: Innovating AI Evaluation – Beyond Accuracy and Precision - SAIL Join us for the SAIL Spring School 2025! We are excited to invite you to the SAIL Spring School 2025, taking place March 26-28 at the CITEC lecture hall at Bielefeld University! We have put together a...

πŸ“£ Spring School 2025: Innovating AI Evaluation – Beyond Accuracy and Precision

πŸ“… March 26–28
πŸ“ CITEC, Bielefeld University

Join us for an exciting line-up of tutorials, discussions, and networking opportunities! πŸŽ“

➑️More info & program: www.sail.nrw/springschool/

16.01.2025 12:33 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

πŸ’‘ Interested? Try it yourself!

Tool: ag-sc.techfak.uni-bielefeld.de/ctvis/

03.02.2025 12:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

For selecting clinical trials to be compared in systematic reviews, it is important they measure the same outcomes. Therefore, we developed a tool that provides an overview of the clinical trial information about glaucoma and type 2 diabetes and enables users to group them by outcomes.

03.02.2025 12:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Open challenges for the automatic synthesis of clinical trials - BMC Research Notes Objective An important criterion for selecting clinical trials to be compared in systematic reviews and meta-analyses is that they measure the same outcomes. However, this represents a challenge as th...

πŸš€ New month, new paper! πŸš€

Our paper "Open challenges for the automatic synthesis of clinical trials" has been published at BMC Research Notes!

πŸŽ“ Authors: Olivia SΓ‘nchez Graillet, David M. Schmidt, Christian Kullik and Philipp Cimiano

πŸ”— Paper: doi.org/10.1186/s131...

03.02.2025 12:42 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

πŸ’‘ Interested? Try it yourself!

Zenodo artifact: doi.org/10.5281/zeno...

GitHub repository: github.com/ag-sc/clinic...

08.01.2025 16:44 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

In this work, we investigate the influence of grammar-constrained decoding (GCD) as well as pointer generators (PG) on the performance of a domain-specific information extraction (IE) system. We investigate whether the addition of GCD and PG improve IE results of fine-tuned encoder-decoder models.

08.01.2025 16:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Illustration of the baseline model as well as the two adjustments added to that baseline, grammar-constrained decoding and pointer generator-like behavior. Words in boxes represent single tokens, numbers below those boxes symbolize outputs from the decoder, where higher values stand for a higher probability that this is the best next token as estimated by the model. For greedy decoding, the token with the highest value is chosen. For GCD, a filter is applied before, visualized as gray, crossed-out boxes for tokens that are filtered out. Red boxes show the selected token. (A) Greedy decoding (baseline, basic). (B) Grammar-constrained decoding (GCD). (C) Pointer generators + grammar-constrained decoding (ptr).

Illustration of the baseline model as well as the two adjustments added to that baseline, grammar-constrained decoding and pointer generator-like behavior. Words in boxes represent single tokens, numbers below those boxes symbolize outputs from the decoder, where higher values stand for a higher probability that this is the best next token as estimated by the model. For greedy decoding, the token with the highest value is chosen. For GCD, a filter is applied before, visualized as gray, crossed-out boxes for tokens that are filtered out. Red boxes show the selected token. (A) Greedy decoding (baseline, basic). (B) Grammar-constrained decoding (GCD). (C) Pointer generators + grammar-constrained decoding (ptr).

πŸš€ New year, new paper! πŸš€

Proud to share our paper "Grammar-constrained decoding for structured information extraction with fine-tuned generative models applied to clinical trial abstracts" has been published at Frontiers in Artificial Intelligence!

πŸ”— doi.org/10.3389/frai...

08.01.2025 16:39 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
ag-sc/neodudes: v1.1.2 Mentioning results folder in root README now.

πŸ§‘β€πŸ’» Additionally, you can find the code and data if our approach on Zenodo, GitHub and DockerHub:
Zenodo artifact: doi.org/10.5281/zeno...
GitHub repository: github.com/ag-sc/neodud...
DockerHub image: hub.docker.com/r/dvs23/neod...

28.11.2024 16:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Lexicalization Is All You Need: Examining theΒ Impact ofΒ Lexical Knowledge inΒ aΒ Compositional QALD System In this paper, we examine the impact of lexicalization on Question Answering over Linked Data (QALD). It is well known that one of the key challenges in interpreting natural language questions with re...

πŸ’‘Missed the talk or want to know more? You can find our paper here:
doi.org/10.1007/978-...
Preprint: doi.org/10.48550/arX...

28.11.2024 16:50 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

At the main conference, I presented our paper "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" as well as an accompanying poster and demo illustrating the strengths of our lexicon-based, compositional question answering approach.

28.11.2024 16:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
David M. Schmidt giving a talk on the paper "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" at the 24th International Conference on Knowledge Engineering and Knowledge Management in Amsterdam.

David M. Schmidt giving a talk on the paper "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" at the 24th International Conference on Knowledge Engineering and Knowledge Management in Amsterdam.

David M. Schmidt presenting a poster and a demo on "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" at the 24th International Conference on Knowledge Engineering and Knowledge Management in Amsterdam.

David M. Schmidt presenting a poster and a demo on "Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System" at the 24th International Conference on Knowledge Engineering and Knowledge Management in Amsterdam.

It has been an exciting week at EKAW 2024 in Amsterdam! Lots of interesting talks, inspiring discussions and entertaining social events! #ekaw2024

28.11.2024 16:47 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Time for a starter pack on information retrieval: go.bsky.app/MXPJoTn

14.11.2024 20:57 β€” πŸ‘ 44    πŸ” 19    πŸ’¬ 17    πŸ“Œ 2

A starter pack for #NLP #NLProc researchers! πŸŽ‰

go.bsky.app/SngwGeS

04.11.2024 10:01 β€” πŸ‘ 254    πŸ” 101    πŸ’¬ 45    πŸ“Œ 14
17.11.2024 20:41 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Hey all! I started a second starter pack with people who didn't make the first one, please let me know if you'd like to be added:

go.bsky.app/JgneRQk

13.11.2024 00:15 β€” πŸ‘ 66    πŸ” 32    πŸ’¬ 70    πŸ“Œ 8

πŸ’¬ Additionally to the paper presentation at the EKAW - International Conference on Knowledge Engineering and Knowledge Management, we will also take part in the poster session. So drop by if you want to discuss future avenues of question answering research!

17.11.2024 22:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
ag-sc/neodudes: v1.1.2 Mentioning results folder in root README now.

πŸ’‘ Interested? Try it yourself!

Zenodo artifact: doi.org/10.5281/zeno...
GitHub repository: github.com/ag-sc/neodud...
DockerHub image: hub.docker.com/r/dvs23/neod...

17.11.2024 22:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System In this paper, we examine the impact of lexicalization on Question Answering over Linked Data (QALD). It is well known that one of the key challenges in interpreting natural language questions with re...

🚨 Thrilled to announce a new preprint!

πŸ“ Title: Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System
πŸ‘©β€πŸŽ“πŸ‘¨β€πŸŽ“ Authors: David M. Schmidt, Mohammad Fazleh Elahi, Philipp Cimiano
πŸ”— Preprint: arxiv.org/abs/2411.03906

17.11.2024 22:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@dmschmidt is following 20 prominent accounts