๐ฃ Announcing our poster session at COLM 2025:
On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
I will talk about biases in LLMs and how to mitigate them. Come say hi!
Poster #43, 4:30 PM
๐ฃ Announcing our poster session at COLM 2025:
On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
I will talk about biases in LLMs and how to mitigate them. Come say hi!
Poster #43, 4:30 PM
Excited that we are having the first talk in AI & Scientific Discovery online seminar on Friday at 12pm ET/11am CT/9am PT by the awesome Lei Li from CMU!
๐งช Generative AI for Functional Protein Design๐ค
#artificialintelligence #scientificdiscovery
ai-scientific-discovery.github.io
Home-grown at CHAI and
@uchicagoci.bsky.social
!! The first ever AI-driven game from academia ๐ฎGive it a go and let us know your rank on the leaderboard!
๐ Weโre thrilled to announce the upcoming AI & Scientific Discovery online seminar! We have an amazing lineup of speakers.
This series will dive into how AI is accelerating research, enabling breakthroughs, and shaping the future of research across disciplines.
ai-scientific-discovery.github.io
As AI becomes increasingly capable of conducting analyses and following instructions, my prediction is that the role of scientists will increasingly focus on identifying and selecting important problems to work on ("selector"), and effectively evaluating analyses performed by AI ("evaluator").
16.09.2025 15:07 โ ๐ 10 ๐ 8 ๐ฌ 2 ๐ 0
We are proposing the second workshop on AI & Scientific Discovery at EACL/ACL. The workshop will explore how AI can advance scientific discovery. Please use this Google form to indicate your interest (corrected link):
forms.gle/MFcdKYnckNno...
More in the ๐งต! Please share! #MLSky ๐ง
โก๏ธEver asked an LLM-as-Marilyn Monroe about the 2020 election? Our paper calls this concept incongruence, common in both AI and how humans create and reason.
๐ง Read my blog to learn what we found, why it matters for AI safety and creativity, and what's next: cichicago.substack.com/p/concept-in...
Prompting is our most successful tool for exploring LLMs, but the term evokes eye-rolls and grimaces from scientists. Why? Because prompting as scientific inquiry has become conflated with prompt engineering.
This is holding us back. ๐งตand new paper with @ari-holtzman.bsky.social .
A first small update, vllm has prevented the package from being installed on mac. Now you can `pip install hypogenic` on mac and generate hypotheses with APIs from your laptop.
09.07.2025 13:50 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0We are making som exciting updates to hypogenic this summer: github.com/ChicagoHAI/h... and will post updates here.
09.07.2025 13:50 โ ๐ 2 ๐ 1 ๐ฌ 0 ๐ 1
When you walk into the ER, you could get a doc:
1. Fresh from a week of not working
2. Tired from working too many shifts
@oziadias.bsky.social has been both and thinks that they're different! But can you tell from their notes? Yes we can! Paper @natcomms.nature.com www.nature.com/articles/s41...
@chachachen.bsky.social @haokunliu.bsky.social @divingwithorcas.bsky.social present posters on human-AI decision making, hypothesis generation, interpretability and fairness at MMLS 2025!
24.06.2025 20:07 โ ๐ 6 ๐ 3 ๐ฌ 0 ๐ 0This is too cute not to share!
28.05.2025 13:28 โ ๐ 5 ๐ 1 ๐ฌ 0 ๐ 0I am glad that you found our paper entertaining! This is a great point for my follow-up thread on the implications of concept incongruence. Our main goal is to raise awareness and provide clarity around concept incongruence.
28.05.2025 12:56 โ ๐ 3 ๐ 4 ๐ฌ 1 ๐ 0
๐จ New paper alert ๐จ
Ever asked an LLM-as-Marilyn Monroe who the US president was in 2000? ๐ค Should the LLM answer at all? We call these clashes Concept Incongruence. Read on! โฌ๏ธ
1/n ๐งต
1/n ๐๐๐ Thrilled to share our latest work๐ฅ: HypoEval - Hypothesis-Guided Evaluation for Natural Language Generation! ๐ง ๐ฌ๐
Thereโs a lot of excitement around using LLMs for automated evaluation, but many methods fall short on alignment or explainability โ letโs dive in! ๐
๐งโโ๏ธHow well can LLMs summarize complex legal documents? And can we use LLMs to evaluate?
Excited to be in Albuquerque presenting our paper this afternoon at @naaclmeeting 2025!
Although I cannot make #NAACL2025, @chicagohai.bsky.social will be there. Please say hi!
@chachachen.bsky.social GPT โ x-rays (Friday 9-10:30)
@mheddaya.bsky.social CaseSumm and LLM ๐งโโ๏ธ (Thursday 2-3:30)
@haokunliu.bsky.social @qiaoyu-rosa.bsky.social hypothesis generation ๐ฌ (Saturday at 4pm)
1/n
You may know that large language models (LLMs) can be biased in their decision-making, but ever wondered how those biases are encoded internally and whether we can surgically remove them?
Spent a great day at Boulder meeting new students and old colleagues. I used to take this view every day.
Here are the slides for my talk titled "Alignment Beyond Human Preferences: Use Human Goals to Guide AI towards Complementary AI": chenhaot.com/talks/alignm...