Somnath Basu Roy Chowdhury's Avatar

Somnath Basu Roy Chowdhury

@somnathbrc.bsky.social

Research Scientist at Google Research https://www.cs.unc.edu/~somnath/

55 Followers  |  111 Following  |  16 Posts  |  Joined: 16.11.2024  |  1.8365

Latest posts by somnathbrc.bsky.social on Bluesky

(9/n) Finally, I would like to thank all my amazing co-authors: Avinava, @abeirami.bsky.social , Rahul, Nicholas, Amr, Snigdha.

cc @unccs.bsky.social

02.04.2025 16:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
[Somnath Basu Roy Chowdhury]Blogs

(8/n) Here is a blog post with a simplified overview of our work: www.cs.unc.edu/~somnath/blo...

Code: github.com/brcsomnath/pef
Paper link: arxiv.org/abs/2503.20098

02.04.2025 16:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(7/n) We would like to highlight previous great works, like LEACE, that perfectly erase concepts to protect against linear adversaries. In our work, we improve upon this method and present a technique that can protect against any adversary.

x.com/norabelrose/...

02.04.2025 16:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(6/n) We also visualize the learned representations from different erasure methods. We observe that PEF perfectly erasure group (or concept) information without losing other information (collapsing the representation space).

02.04.2025 16:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(5/n) Empirically, we observe that PEF reaches the theoretical limits of erasure even in challenging settings where other methods struggle, including both linear (INLP, LEACE) and non-linear techniques (FaRM, KRaM).

02.04.2025 16:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(4/n) When the distributions are unequal, we still achieve perfect erasure but with a slightly reduced utility. The erasure function in this setting is shown below.

02.04.2025 16:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(3/n) From the above limits, we show that optimally perfect concept erasure is only feasible when the underlying distributions are equal up to permutations. In such scenarios, the erasure function is shown in the diagram.

02.04.2025 16:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(2/n) We study the fundamental limits of concept erasure. Borrowing from the work of @FlavioCalmon et al in information theory literature, we characterize the erasure capacity and maximum utility that can be retained during concept erasure.

02.04.2025 16:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐‡๐จ๐ฐ ๐œ๐š๐ง ๐ฐ๐ž ๐ฉ๐ž๐ซ๐Ÿ๐ž๐œ๐ญ๐ฅ๐ฒ ๐ž๐ซ๐š๐ฌ๐ž ๐œ๐จ๐ง๐œ๐ž๐ฉ๐ญ๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐‹๐‹๐Œ๐ฌ?

Our method, Perfect Erasure Functions (PEF), erases concepts perfectly from LLM representations. We analytically derive PEF w/o parameter estimation. PEFs achieve pareto optimal erasure-utility tradeoff backed w/ theoretical guarantees. #AISTATS2025 ๐Ÿงต

02.04.2025 16:03 โ€” ๐Ÿ‘ 39    ๐Ÿ” 8    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3

Please stop by our posters if youโ€™re interested. Feel free to reach out if you're interested in AI safety, efficiency, and just want to chat!

CC: @unccs.bsky.social

06.12.2024 19:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

(3/3) ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐’๐œ๐š๐ฅ๐š๐›๐ฅ๐ž ๐„๐ฑ๐š๐œ๐ญ ๐Œ๐š๐œ๐ก๐ข๐ง๐ž ๐”๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐”๐ฌ๐ข๐ง๐  ๐๐„๐…๐“

Iโ€™m also presenting my ongoing unlearning work at SafeGenAI Workshop. This uses a novel PEFT training approach to improve exact unlearning efficiency

arxiv.org/abs/2406.16257

06.12.2024 19:24 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(2/3) ๐…๐š๐ฌ๐ญ ๐“๐ซ๐ž๐ž-๐…๐ข๐ž๐ฅ๐ ๐ˆ๐ง๐ญ๐ž๐ ๐ซ๐š๐ญ๐จ๐ซ

An efficient method for graph field integration (a special case of matrix-vector mult.) using integrator trees. FTFI enables polylog-lin. time multiplication w/ performance boost in vision transformers

arxiv.org/abs/2406.15881

06.12.2024 19:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

๐ŸšจIโ€™m traveling to #NeurIPS2024 next week to present these papers.

(1/3) ๐’๐ญ๐ซ๐ฎ๐œ๐ญ๐ฎ๐ซ๐ž๐ ๐”๐ง๐ซ๐ž๐ฌ๐ญ๐ซ๐ข๐œ๐ญ๐ž๐-๐‘๐š๐ง๐ค ๐Œ๐š๐ญ๐ซ๐ข๐œ๐ž๐ฌ ๐Ÿ๐จ๐ซ ๐๐„๐…๐“

A new PEFT method replacing low-rank matrices (LoRA) with more expressive structured matrices

arxiv.org/abs/2406.17740

06.12.2024 19:24 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Please stop by our posters if youโ€™re interested. Feel free to reach out if you're interested in AI safety, efficiency, and just want to chat!

CC: @unccs.bsky.social

06.12.2024 19:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

(3/3) ๐“๐จ๐ฐ๐š๐ซ๐๐ฌ ๐’๐œ๐š๐ฅ๐š๐›๐ฅ๐ž ๐„๐ฑ๐š๐œ๐ญ ๐Œ๐š๐œ๐ก๐ข๐ง๐ž ๐”๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐”๐ฌ๐ข๐ง๐  ๐๐„๐…๐“

Iโ€™m also presenting my ongoing unlearning work at SafeGenAI Workshop. This uses a novel PEFT training approach to improve exact unlearning efficiency

arxiv.org/abs/2406.16257

06.12.2024 19:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

(2/3) ๐…๐š๐ฌ๐ญ ๐“๐ซ๐ž๐ž-๐…๐ข๐ž๐ฅ๐ ๐ˆ๐ง๐ญ๐ž๐ ๐ซ๐š๐ญ๐จ๐ซ

An efficient method for graph field integration (a special case of matrix-vector mult.) using integrator trees. FTFI enables polylog-lin. time multiplication w/ performance boost in vision transformers

arxiv.org/abs/2406.15881

06.12.2024 19:15 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@somnathbrc is following 20 prominent accounts