π¨ Introducing CausalPFN, a foundation model trained on simulated data for in-context causal effect estimation, based on prior-fitted networks (PFNs). Joint work with Hamid Kamkari, Layer6AI & @rahulgk.bsky.social π§΅[1/7]
π arxiv.org/abs/2506.07918
π github.com/vdblm/Causal...
π£οΈOral@ICML SIM workshop
11.06.2025 13:13 β π 4 π 1 π¬ 1 π 2
Theres lots more to do to understand CFT better, and build on it to create better post-training methods to fine-tune large language models.
Reach out to me or Ethan if you're interested in collaborating on this or pushing this idea to new domains and problems!
23.04.2025 22:44 β π 1 π 0 π¬ 0 π 0
π Weβve also open-sourced OpenMedText, integrating 121K biomedical articles & 29 medical textbooks to push future research in domain-adaptive fine-tuning in biomedicine.
23.04.2025 22:44 β π 1 π 0 π¬ 1 π 0
π§ We "negative" and "adaptive" prompts, confirming that the semantic content of prompts changes and impacts fine-tuning effectiveness.
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
π Results: On medical benchmarks, CFT improves accuracy by ~2.25% over CPT; in finance, it boosts performance by ~4.32%! Importantly, these gains scale effectively with larger models. π
Check out Appendix E.1 for preliminary results on GEMINI Flash 1.5M!
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
π₯ We tested this idea in biomedical (using newly curated OpenMedText dataset of journals & textbooks!) and financial dataβCFT significantly outperforms continued pretraining (CPT) and instruction fine-tuning (IFT) in zero-shot settings.
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
π Instead of using Q&A as in instruction tuning, CFT uses reflective instructions (e.g., "Reflect on how what you will see changes what you know...") motivated by how humans learn.
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
π‘Contextual finetuning (CFT) uses contextual prompts during fine-tuning to adaptively change the semantic understanding that LLMs leverage during the process of learning new information.
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
π Problem: Language models struggle with rapidly evolving info and context in fields like medicine & finance. We need ways to teach LLMs new information and control how they absorb this knowledge.
π Insight: Why not explain and teach LLMs how to learn?
23.04.2025 22:44 β π 0 π 0 π¬ 1 π 0
My student, Ethan Choi, will be at #ICLR2025 presenting Contextual Finetuning (CFT) and teaching LLMs how to learn (joint work with Muhammad Adil Asif, Ziwen Han, John Willes @vectorinstitute.ai)
πProject page: younwoochoi.github.io/cft-iclr/
#239, April 26 10-12:30(Hall3,2B)
23.04.2025 22:44 β π 2 π 0 π¬ 1 π 0
If it helps, I usually learn something new (either directly or from further digging) about the behavior of markets.
21.04.2025 21:23 β π 1 π 0 π¬ 2 π 0
YouTube video by Schwartz Reisman Institute
Rahul G. Krishnan | From associational to causal predictions with deep learning
π£T-CAIREM member @rahulgk.bsky.social's presentation is online! From Associational to Causal Predictions with #DeepLearning: An examination of recent advances in bridging the gap between associative #neuralnetworks and causal reasoning.
π₯ www.youtube.com/watch?v=yE6S...
24.02.2025 20:01 β π 1 π 2 π¬ 0 π 0
Rocking that @ Gmail address!
31.01.2025 15:48 β π 2 π 0 π¬ 1 π 0
Come by tomorrow to hear about what we have been up to!
28.01.2025 17:52 β π 2 π 0 π¬ 1 π 0
I thought about this a bit, I think helping PhD students close the translational gap from research to deployment (in industry or their own startups), particularly if they don't want to go into academia, is one way forward.
21.12.2024 21:07 β π 4 π 0 π¬ 0 π 1
o3 is incredible!
Since we've maxed out scale and $$$ on scaling inference-time compute I hope we now get back to thinking about the right combination of neural nets and algorithm to performant models cheaper, faster, and more reliably.
21.12.2024 21:03 β π 1 π 1 π¬ 0 π 0
1/6
Presenting "Unlearning Tabular Data without a 'Forget Set'"! We explore a new unlearning algorithm RELOAD in tabular learning. Drop by @neuripsconf.bsky.social Workshop on Table Representation Learning (@trl-research.bsky.social):
- SAT 14 Dec from 2:30pm-3:15pm!
- East Meeting Room 11-12
14.12.2024 22:00 β π 1 π 1 π¬ 5 π 0
Are you around at Neurips? Would love to say hi and catch up!
12.12.2024 18:10 β π 1 π 0 π¬ 1 π 0
Come by our poster today to learn about decision making under unobserved confounding!
12.12.2024 16:35 β π 1 π 0 π¬ 1 π 0
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
Finally, if you're interested in understanding how to leverage energy-based normalizing flows, check out Lance's work on Meow (chienfeng-hub.github.io/meow/)
He'll be presenting on Dec. 12, 11:00 AMβ2:00 PM at West Ballroom A-D #6403
π§΅(7/7)
11.12.2024 00:20 β π 0 π 0 π¬ 1 π 0
NATURAL
@nikitadhawan.bsky.social developed NATURAL (www.cs.toronto.edu/~nikita/natu...) with @cottascience.bsky.social , Karen & @cmaddis.bsky.social. Its an end-to-end pipeline that starts from raw-text data and ends with a causal (**) effect associated with an intervention.
(**) conditions apply
π§΅(6/7)
11.12.2024 00:20 β π 5 π 1 π¬ 1 π 3
b] ~Billions of dollars each year are spent on trials to assess interventions.
Can we use crowdsourced data to know which intervention is likely to work ahead of time?
Doing so requires answering a causal question!
But the data to answer this question is locked in unstructured text.
π§΅(5/7)
11.12.2024 00:20 β π 0 π 1 π¬ 1 π 0
Find Vahid to learn more about in-context causal inference and lots of other cool problems that he spends his time thinking about!
π§΅(4/7)
11.12.2024 00:20 β π 1 π 0 π¬ 1 π 0
a] Today, we learn from data and treat it as ground truth -- should we?
A doctor often knows more about their patient than is represented in electronic medical records.
A teacher knows more about their students than what their grades suggest.
π§΅(2/7)
11.12.2024 00:20 β π 1 π 0 π¬ 1 π 0
First post! Iβll be at @NeurIPSConf #NeurIPS2024 until Sunday. I'd love to chat about causality for medicine & science.
I'm also looking for a postdoc interested in experimental design for medicine, if that's you, send me a message.
I'll be presenting two papers at the main conference.
π§΅(1/7)
11.12.2024 00:20 β π 2 π 0 π¬ 1 π 1
AI Comms Director at CIFAR
Tech, books, write, create
Chief Models Officer @ Stealth Startup; Inria & MVA - Ex: Llama @AIatMeta & Gemini and BYOL @GoogleDeepMind
CS PhD Student at NYU, previously @MetaAI. Trying to make ML more reliable, predictable, and representative.
MIT Researcher, he/him, Senior Visiting Researcher @ Ritsumeikan, Co-Founder of Humanyze, former Senior Researcher @ HBS, author of People Analytics. AI, management, law, corporate governance, psychology, anthropology, ethics, and similar topics
Foundation Models for Generalizable Autonomy.
Assistant Professor in AI Robotics, Georgia Tech
prev Berkeley, Stanford, Toronto, Nvidia
Assistant Professor at Stanford Statistics and Stanford Data Science | Previously postdoc at UW Institute for Protein Design and Columbia. PhD from MIT.
Transforming health through Artificial Intelligence. (Based at the University of Toronto's Temerty Faculty of Medicine.) Reposts are not endorsements.
The Department of Computer Science at the University of Toronto.
AI innovator. Builder of social futures
@metaminds.bsky.social
. Pro-democracy. Resisting fascism. The Singularity is near.
Musician.
Novelist, essayist, AI optimist. Death of an Author, The Next Civil War, On Writing and Failure.
Math Assoc. Prof. (On leave, Aix-Marseille, France)
Teaching Project (non-profit): https://highcolle.com/
Research director | @McGillU @Mila_Quebec @IVADO_Qc | My team designs machine learning frameworks to understand biological systems from new angles of attack
Integrative research. Human-centred solutions. We're a U of T institute working to ensure that powerful technologies make the world betterβfor everyone.
Asst prof at Duke University. Co-founder at LayerHealth.
ML and NLP for healthcare.
PhD MIT, BS/MS Stanford. She/her.
Senior Research Scientist @MBZUAI. Focused on decision making under uncertainty, guided by practical problems in healthcare, reasoning, and biology.
Research & code: Research director @inria
βΊData, Health, & Computer science
βΊPython coder, (co)founder of scikit-learn, joblib, & @probabl.bsky.social
βΊSometimes does art photography
βΊPhysics PhD
Blog: https://argmin.substack.com/
Webpage: https://people.eecs.berkeley.edu/~brecht/
Entrepreneur
Costplusdrugs.com
Professor and Canada CIFAR AI Chair (Amii) at the University of Alberta, Dept. Medicine, BLINCLab. Corporate director. Previously: office co-lead at DeepMind Alberta. https://pilarski.github.io