Raman Dutt's Avatar

Raman Dutt

@ramandutt4.bsky.social

Generative AI @Noah's Ark Lab, Huawei & @TuringInstitute | PhD candidate in Biomedical AI @ University of Edinburgh | Efficient Fine-Tuning in Medical AI, Diffusion Models, Autoregressive Image Generation

1,468 Followers  |  736 Following  |  55 Posts  |  Joined: 18.11.2024
Posts Following

Posts by Raman Dutt (@ramandutt4.bsky.social)

Post image 17.03.2025 17:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This de identification token is consistently present throughout the dataset and contributes nothing towards improving the image quality.

This points towards a major flaw in the dataset given MIMIC is one of the most significant medical datasets for T2I generation. πŸ’”

22.02.2025 15:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It turns out that this de identification token (β€œ___”) holds the most significant contribution towards memorizing training images.

In other words, steps taken to protect patient information are in fact posing a threat to it.

22.02.2025 15:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

MIMIC dataset contains pairs of images and corresponding text reports. These are raw reports describing the images.

In the dataset, the sensitive patient information is hidden or de identified. This is done by replacing it with three underscores (β€œ___”).

22.02.2025 15:25 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Are you working on Text-to-Image generation of Chest X-Rays using the MIMIC dataset?

Here is something I found over the last weekend. 🧡

Observations documented in this preprint -

arxiv.org/abs/2502.07516

22.02.2025 15:21 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hello all,
What’s the best tool to make nice figures for academic AI papers?

07.12.2024 15:53 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Not an ML-related post but I am just as happy to share this

06.12.2024 13:48 β€” πŸ‘ 8    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

1000 followers on Bluesky already that’s crazy

05.12.2024 15:21 β€” πŸ‘ 9    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A new starter pack for Medical AI researchers!

go.bsky.app/PddA2uy

27.11.2024 23:05 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Done!

27.11.2024 13:13 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Sharing with people who might find this relevant - go.bsky.app/r5eVvT, go.bsky.app/PJKJ8vK, bsky.app/profile/berk...

26.11.2024 18:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Hence, we devise MemControl - a framework that searches for the optimal parameters to be fine-tuned to:

(1) Improve image generation quality
(2) Reduce Memorization!

MemControl leads to optimal model capacity that should be used during fine-tuning: Not more, not less!

26.11.2024 18:31 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

We also found that fine-tuning different subsets of parameters in a diffusion model can affect generative quality and memorization differently!
Each marker in the figure is a diffusion model finetuned on the same data but with different parameter subset.

Full FT (green) leads to high memorization!

26.11.2024 18:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We provide empirical proof that reducing the model capacity (by fine-tuning fewer parameters) can lead to reduced memorization!

Q. How to fine-tune with fewer parameters? πŸ€”
A. Parameter-Efficient Fine-Tuning (PEFT) ✨

26.11.2024 18:26 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Figure showing how conventional fine-tuning methods can lead to replication of artifacts (red boxes)

Figure showing how conventional fine-tuning methods can lead to replication of artifacts (red boxes)

The conventional way of fine-tuning models (full fine-tuning) can lead to replication of artifacts in X-Rays that can further lead to leakage of patient information, thus endangering patient privacy.

Artifact replication is shown in red boxes.

26.11.2024 18:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - Raman1121/Diffusion_Memorization_HPO: A framework to reduce memorization in text-to-image diffusion models using HPO A framework to reduce memorization in text-to-image diffusion models using HPO - Raman1121/Diffusion_Memorization_HPO

Delighted to share our work "πŒπžπ¦π‚π¨π§π­π«π¨π₯" now accepted at 𝐖𝐀𝐂𝐕 'πŸπŸ“. We show strong results for medical image generation and also establish an initial benchmark for generative quality and memorization of synthetic chest x-rays!

Paper: arxiv.org/abs/2405.19458
Code: github.com/Raman1121/Di...

MoreπŸ‘‡

26.11.2024 18:21 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

Would love to join!

26.11.2024 11:24 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

MIDL Conference has joined BlueSky!

bsky.app/profile/midl...

26.11.2024 11:24 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Please add me πŸ˜‚

25.11.2024 16:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I would love to be added πŸ˜‚

25.11.2024 16:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@smcgrath.phd

25.11.2024 16:05 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Again, very sorry to hear about what you are going through. Advertising here is a great idea. I personally got some good advice about a condition I was going through. Wish I could be more helpful though. Wishing you the best!

25.11.2024 15:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

So sorry to hear about this @ian-goodfellow.bsky.social . Do you think any of this might be related to prolonged headphone usage in addition to many other factors? Or if that exacerbates the condition?

25.11.2024 15:25 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

For my fellow medical AI researchers, here is a starter pack - go.bsky.app/r5eVvT

25.11.2024 15:23 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

I would consider myself slightly cracked haha. Would love to be added!

25.11.2024 12:53 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Would love to be added!

25.11.2024 08:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Would love to be added!!

25.11.2024 08:40 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

A starter pack I would highly recommend (not biased at all πŸ˜‰)

25.11.2024 08:39 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Would love to be added! Currently doing a PhD in Biomedical AI

25.11.2024 08:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0