This work builds on our recent study on Automated Structured Radiology Report Generation (x.com/IAMJBDEL/st...) which introduces the dataset and evaluation framework.
12.06.2025 14:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0@iamjbd.bsky.social
๐ค ML at Hugging Face ๐ฒ Academic Staff at Stanford University (AIMI Center) ๐ฆด Radiology AI is my stuff
This work builds on our recent study on Automated Structured Radiology Report Generation (x.com/IAMJBDEL/st...) which introduces the dataset and evaluation framework.
12.06.2025 14:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0Huge thanks to the amazing team at Stanford Center for Artificial Intelligence in Medicine and Imaging (AIMI): Johannes Moll, Louisa Fay, @asfandyar_azhar, @SophieOstmeier, Tim Lueth, Sergios Gatidis, @curtlanglotz
12.06.2025 14:17 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐ Paper: arxiv.org/abs/2506.00200
๐ Project Page: stanford-aimi.github.io/structuring...
๐ค Models & Data: huggingface.co/collections...
All models and datasets are fully open-source โ we hope this contributes to the broader medical AI community! ๐ค
We benchmark lightweight models (<300M params) against state-of-the-art LLMs (up to 70B params), using human-reviewed test data and clinically grounded evaluation metrics. Our results highlight the strong potential of specialized, efficient models in clinical NLP application.
12.06.2025 14:17 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐ฅ Excited to share our latest work: Structuring Radiology Reports: Challenging LLMs with Lightweight Models
In this study, we explore how small, task-specific encoder-decoder models can rival (and sometimes outperform) much larger LLMs; all while being faster, cheaper, and easier to deploy.
ons.
Paper, soon to appear at #ACL2025 main: arxiv.org/pdf/2505.24223
Project page, with all resources (datasets, models, ontology) and usage notes: stanford-aimi.github.io/srrg.html
All models and datasets are publicly available as open-source:
huggingface.co/collections...
4) We conduct a reader study to create a radiologist-validated test set for both the automated structured radiology report task, as well as utterances disease labels from our new ontology.
Finally, external evaluation is conducted using out-of-institution data by @hopprai.
3) We fine-tune popular RRG system on this restructured findings and impression, namely:
- Chexagent @StanfordAIMI
- MAIRA-2 @MSFTResearch
- RaDialog @TU_Muenchen
- Chexpert-plus @StanfordAIMI
As well as a BERT architecture for the disease classification system on our new ontology.
2) Since each reported observation, whether in the findings or impression sections, is expressed as a single utterance (1.5M unique utterances in total), we use a large language model to label each one according to a new ontology comprising 72 critical chest X-ray (CXR) observations.
09.06.2025 15:13 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 01) We leverage LLM to restructure MIMIC-CXR and Chexpert-plus (180K Findings sections and 400K Impression sections) into reports categorized by organ system, under strict rules.
09.06.2025 15:13 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0๐ฅ We unveil our paper accepted at the #ACL2025 Main Conference:
Automated Structured Report Generation
Let's revisit automated radiology report generation for CXR.
Free-form reports make it hard for AI systems to learn accurate generation, and even harder to evaluate. ๐งต๐
@StanfordAIMI @hopprai
Sociodemographic biases in medical decision making by large language models
www.nature.com/articles/s4...
Just noticed our lightweight RRG model has been downloaded over 92,000 times this months on ๐คHuggingFace. This model was included in the CheXpert-Plus release and contains just 67 million parameters:
huggingface.co/IAMJB/chexpe...
Its also a top ranking model on RexRank (rexrank.ai)
๐งต What if AI could learn from millions of unlabeled radiology images and reportsโand then flexibly adapt to new clinical tasks? In a new comprehensive review in
@radiology_rsna, colleagues at stanford dive into how foundation models (FMs) are set to revolutionize radiology!
"Second, we develop budget forcing to control test-time compute by forcefully terminating the model's thinking process or lengthening it by appending "Wait" multiple times to the model's generation when it tries to end."
What a trick...
Is this the last benchmark before AGI? Humanity's Last Exam (HLE)
๐คฏย 3,000 expert-level questionsย acrossย 100+ subjects, created by nearlyย 1,000 subject matter expertsย globally.
DeepSeek-R1: next level
25.01.2025 05:14 โ ๐ 14 ๐ 1 ๐ฌ 3 ๐ 0๐ฑ. Working Memory: Compiles long-term and task memory to create the final prompt for the LLM.
Typically, 1โ3 = Long-Term Memory; 5 = Short-Term Memory.
Thoughts on agent memory?๐
๐ฎ. Semantic Memory: External/grounding knowledge or self-knowledge, similar to RAG context.
๐ฏ. Procedural Memory: System setup details like prompts, tools, and guardrails (stored in Git/registries).
๐ฐ. Task Memory: Info retrieved from long-term storage for immediate tasks.
๐ ๐ฆ๐ถ๐บ๐ฝ๐น๐ฒ ๐๐๐ถ๐ฑ๐ฒ ๐๐ผ ๐๐ ๐๐ด๐ฒ๐ป๐ ๐ ๐ฒ๐บ๐ผ๐ฟ๐ ๐
An agent's memory helps it plan and react by leveraging past interactions or external data via prompt context. Hereโs a breakdown:
๐ญ. Episodic Memory: Logs past actions/interactions (e.g., stored in a vector database for semantic search).
๐งฉ The future of creativity is elemental. โจ
Kling AI just announced Elements
๐ First, world building:
Craft your characters, environments, props. Plan your motion and VFX.
๐๏ธ Then, remixing:
Bring it all together into a cohesive story.
Oops. Thanks!
16.01.2025 21:37 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Amazing. Agent Roles:
โณ PhD Agent: Conducts literature reviews, interprets results, writes reports.
โณ Postdoc Agent: Plans research, designs experiments.
โณ ML Engineer Agent: Prepares data, writes, optimizes code.
โณ Professor Agent: Oversees, refines reports.
MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era
>> Hybrid linear-softmax attention working very well at large scale and long-context
filecdn.minimax.chat/_Arxiv_MiniM...
first look into what the Qwen team used to develop QwQ
arxiv.org/pdf/2501.07301
Neat: Representing Long Volumetric Video with Temporal Gaussian Hierarchy
Contrib: Temporal Gaussian Hierarchy representation for long volumetric video.
Nice visualization of RAG vs. Agentic RAG
13.01.2025 17:37 โ ๐ 6 ๐ 2 ๐ฌ 2 ๐ 0Neat. Converts images, PDFs, and Office documents to Markdown or JSON using OCR and LLM models, with features for caching, distributed processing, and PII removal
12.01.2025 04:33 โ ๐ 23 ๐ 1 ๐ฌ 1 ๐ 0