Jonathan H Chen MD PhD's Avatar

Jonathan H Chen MD PhD

@jonc101.bsky.social

Physician Data Scientist - Stanford Center for Biomedical Informatics Research + Division of Hospital Medicine + Clinical Excellence Research Center + Biomedical Data Science

309 Followers  |  56 Following  |  166 Posts  |  Joined: 05.01.2025  |  2.2465

Latest posts by jonc101.bsky.social on Bluesky

statistically non-inferior to a single human expert (p < 0.001). Our benchmark provides evidence of LMs approaching expert-level ability in validating AI-generated medical text."

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

(p < 0.001) alignment with physicians across seen and unseen tasks, increasing average F1 scores from 66% to 83%. Despite strong baseline performance, MedVAL improves the best-performing proprietary LM (GPT-4o) by 8% without training on physician-labeled data, demonstrating a performance

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

outputs. To evaluate LM performance, we introduce MedVAL-Bench, a dataset of 840 physician-annotated outputs across 6 diverse medical tasks capturing real-world challenges. Across 10 state-of-the-art LMs spanning open-source and proprietary models, MedVAL distillation significantly improves

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

To address these challenges, we propose MedVAL, a novel, self-supervised, data-efficient distillation method that leverages synthetic data to train evaluator LMs to assess whether LM-generated medical outputs are factually consistent with inputs, without requiring physician labels or reference

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

is challenging because 1) manual review is costly and 2) expert-composed reference outputs are often unavailable in real-world settings. While the "LM-as-judge" paradigm (a LM evaluating another LM) offers scalable evaluation, even frontier LMs can miss subtle but clinically significant errors.

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Abstract: "With the growing use of language models (LMs) in clinical environments, there is an immediate need to evaluate the accuracy and safety of LM-generated medical text. Currently, such evaluation relies solely on manual physician review. However, detecting errors in LM-generated text

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

conditions using opportunistic imaging. Before joining Stanford, he completed a Master’s in Electrical and Computer Engineering at UT Austin, where he worked on improving medical image reconstruction by learning priors from corrupted data, advised by Jon Tamir and Alex Dimakis.

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

AI and expert clinician-level performance. His recent projects focus on 1) improving LLMs as expert-level evaluators of AI-generated medical text, 2) improving robustness of language model benchmarks across diverse medical tasks using prompt optimization, and 3) detection of underdiagnosed medical

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bio: Asad is a research staff at Stanford, advised by Akshay Chaudhari. His research broadly focuses on developing machine learning methods for healthcare applications. More concretely, he is interested in building scalable, self-supervised methods to help bridge the gap between

28.10.2025 00:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

@stanforddeptmed.bsky.social Biomedical Informatics Research Colloquia

β€œMedVAL: Toward Expert-Level Medical Text Validation with Language Models”
Asad Aali, MS.

Thursday, October 30th, 2025
12:00 to 1:00 pm PST

stanford.zoom.us/j/9788759601...

Webinar ID: 978 8759 6012
Webinar Passcode: 420642

28.10.2025 00:00 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This talk will describe how Comet is trained across diverse health systems, what scaling reveals about generalization and medical reasoning, and how these capabilities can be applied to improve prediction, discovery, and patient outcomes in real-world settings."

21.10.2025 02:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Abstract: "Generative models have the potential to transform how health systems learn from data. Comet, Epic’s large-scale generative medical model, is designed to represent patient histories as sequences of clinical events, enabling reasoning about disease trajectories and care outcomes.

21.10.2025 02:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bio: Software developer and lead of Comet team at Epic Systems.

21.10.2025 02:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Abstract: "The talk outlines how integrating rich clinical data with AIβ€”especially large language modelsβ€”can power β€œprecision education” that delivers individualized, outcome-driven learning and assessment across medical training and practice."

11.10.2025 16:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Jesse lives with his wife and two children in the Lower East Side of New York City.

11.10.2025 16:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

and translational research, Jesse leads grant-funded studies exploring the intersection of medical education, informatics, and AI. His work aims to optimize trainee clinical performance and develop personalized educational interventions.

11.10.2025 16:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bio: Jesse Burk-Rafel, an assistant professor of medicine at NYU Grossman School of Medicine, directs research at the NYU Institute for Innovations in Medical Education. He is also a hospitalist and inaugural Research Coach in the Division of Hospital Medicine. With a background in bioengineering

11.10.2025 16:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Jesse Burk-Rafel, MD, MRes.

Jesse Burk-Rafel, MD, MRes.

@stanforddeptmed.bsky.social Biomedical Informatics Research Colloquia
β€œPrecision Education in the AI Era”
Jesse Burk-Rafel, MD, MRes.

Thursday, October 16th, 2025
12:00 to 1:00 pm PST

Live Stream
stanford.zoom.us/j/9788759601...

Webinar ID: 978 8759 6012
Webinar Passcode: 420642

11.10.2025 16:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

determine when to trust AI autonomously, when human oversight is essential, and when to avoid AI entirely."

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

medical education landscape. Participants will learn to navigate the "Alignment Paradox"β€”ensuring AI tools serve educational goals rather than undermine themβ€”through an evidence-based decision framework. This practical approach, grounded in principles of AI performance patterns, helps educators

07.10.2025 00:02 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Abstract: "Artificial intelligence promises to revolutionize medical education, yet most institutions struggle to move beyond pilot projects to meaningful implementation. This talk bridges the gap between AI's potential and current reality by showcasing real-world applications from across the

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

keynotes, grand rounds, and workshops at leading institutions around the world. Her vision is to democratize access to individualized, mastery-based medical training by harnessing AI to scale feedback, foster equity, and capture the richness of clinical reasoning.

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

holds several patents pending on AI-driven educational platforms, and has been invited to contribute to advisory committees for the AAMC, AMA, ABMS, and the International Advisory Committee on AI in Health Professions Education. Widely recognized as a thought leader, Dr. Turner has delivered invited

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

initiatives to integrate AI responsibly into medical education. Her work focuses on leveraging multi-agent architectures, learning analytics, and adaptive assessment systems to advance precision medical education and reduce disparities in training. She has secured multiple competitive grants,

07.10.2025 00:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

an educational technology company developing AI-powered platforms for personalized learning in healthcare. An interdisciplinary scholar with expertise in artificial intelligence, natural language processing, fuzzy logic, and educational informatics, Dr. Turner leads national and international

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Bio: Dr. Laurah Turner, PhD is the Associate Dean for Artificial Intelligence and Educational Informatics and Associate Professor of Biostatistics, Health Informatics and Data Sciences and Medical Education at the University of Cincinnati College of Medicine. She is also co-founder of 2-Sigma,

07.10.2025 00:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Laurah Turner, PhD

Laurah Turner, PhD

@stanforddeptmed.bsky.social Biomedical Informatics Research Colloquia
β€œApplied Intelligence: Integrating AI Technologies Into Medical Education”
Laurah Turner, PhD.

Thursday, October 9th, 2025
12:00 to 1:00 pm PST

stanford.zoom.us/j/9788759601...

Webinar ID: 978 8759 6012
Passcode: 420642

06.10.2025 23:59 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1
Post image

Years ago, I led a National Academy of Medicine report chapter, where I called for the emphasis on what Computers and Humans are each especially good at. But..., what belongs in each column may need some rethinking. nam.edu/wp-content/u...

02.10.2025 23:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

having a combination of ALL of these attributes in a single person (or entity) remains essential and potent.

02.10.2025 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Struggling in past year to articulate what good a human clinician/professional is good for anymore, because many things that were true a few years ago are actively being challenged. A dynamic space, but

02.10.2025 23:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@jonc101 is following 20 prominent accounts