Jaydeep Borkar's Avatar

Jaydeep Borkar

@jaydeepborkar.bsky.social

Visiting Researcher at Meta NYCπŸ¦™ and PhD student at Northeastern. Organizer at the Trustworthy ML Initiative (trustworthyml.org). s&p in language models + mountain biking. jaydeepborkar.github.io

40 Followers  |  33 Following  |  34 Posts  |  Joined: 27.12.2024  |  2.1799

Latest posts by jaydeepborkar.bsky.social on Bluesky

Preview
CS PhD Statements of Purpose cs-sop.org is a platform intended to help CS PhD applicants. It hosts a database of example statements of purpose (SoP) shared by previous applicants to Computer Science PhD programs.

It is PhD application season again πŸ‚ For those looking to do a PhD in AI, these are some useful resources πŸ€–:

1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]

01.10.2025 20:37 β€” πŸ‘ 9    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Post image Post image

"AI slop" seems to be everywhere, but what exactly makes text feel like "slop"?

In our new work (w/ @tuhinchakr.bsky.social, Diego Garcia-Olano, @byron.bsky.social ) we provide a systematic attempt at measuring AI "slop" in text!

arxiv.org/abs/2509.19163

🧡 (1/7)

24.09.2025 13:21 β€” πŸ‘ 27    πŸ” 12    πŸ’¬ 1    πŸ“Œ 1
Preview
TALKIN' 'BOUT AI GENERATION: COPYRIGHT AND THE GENERATIVE-AI SUPPLY CHAIN | The Copyright Society We know copyright

After 2 years in press, it's published!

"Talkin' 'Bout AI Generation: Copyright and the Generative-AI Supply Chain," is out in the 72nd volume of the Journal of the Copyright Society

copyrightsociety.org/journal-entr...

written with @katherinelee.bsky.social & @jtlg.bsky.social (2023)

10.09.2025 19:08 β€” πŸ‘ 11    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

it was soo fun!

30.07.2025 04:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Excited to be attending ACL in Vienna next week! I’ll be co-presenting a poster with Niloofar Mireshghallah on our recent PII memorization work on July 29 16:00-17:30 Session 10 Hall 4/5 (& at LLM memorization workshop)!

If you would like to chat memorization/privacy/safety/, please reach out :)

22.07.2025 04:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Big congratulations!! 🎊

22.07.2025 04:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

🍿 the place to be to meet some v cool interpretability folks (including my phd friends) :)

02.07.2025 02:55 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

now a part of meta superintelligence labs! πŸ¦™ exciting times!

02.07.2025 02:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

congrats!! 🎊

16.05.2025 17:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

big thanks to my wonderful co-authors Matthew Jagielski @katherinelee.bsky.social Niloofar Mireshghallah @dasmiq.bsky.social Christopher A. Choquette-Choo!!

15.05.2025 18:01 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Privacy Ripple Effects has been accepted to the Findings of ACL 2025! πŸŽ‰

See you in Vienna! #ACL2025

15.05.2025 17:24 β€” πŸ‘ 13    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0
Post image

Very excited to be joining Meta GenAI as a Visiting Researcher starting this June in New York City!πŸ—½ I’ll be continuing my work on studying memorization and safety in language models.

If you’re in NYC and would like to hang out, please message me :)

15.05.2025 03:18 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 2

πŸ˜‚πŸ˜‚

15.05.2025 02:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

I am at CHI this week to present my poster (Framing Health Information: The Impact of Search Methods and Source Types on User Trust and Satisfaction in the Age of LLMs) on Wednesday April 30

CHI Program Link: programs.sigchi.org/chi/2025/pro...

Looking forward to connecting with you all!

29.04.2025 00:50 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

4/26 at 3pm:

'Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon'
USVSN Sai Prashanth Β· @nsaphra.bsky.social et al

Submission: openreview.net/forum?id=3E8...

25.04.2025 17:28 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Bummed to be missing ICLR, but if you’re interested in all things memorization, stop by poster #200 Hall 3 + Hall 2B on April 26 3-5:30 pm and chat with several of my awesome co-authors.

We propose a taxonomy for different types of memorization in LMs. Paper: openreview.net/pdf?id=3E8YN...

21.04.2025 19:22 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

[πŸ“„] Are LLMs mindless token-shifters, or do they build meaningful representations of language? We study how LLMs copy text in-context, and physically separate out two types of induction heads: token heads, which copy literal tokens, and concept heads, which copy word meanings.

07.04.2025 13:54 β€” πŸ‘ 78    πŸ” 20    πŸ’¬ 1    πŸ“Œ 6

BU and Boston are incredibly lucky to have Naomi!!!

27.03.2025 04:39 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

wooohooo!!! Congratulations!!!

27.03.2025 04:38 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Really liked this slide by @afedercooper.bsky.social on categorizing extraction vs regurgitation vs memorization of training data at CS&Law today!

25.03.2025 21:11 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is some great work!! I personally feel that one of the bottlenecks with memorization evals is having access to the gigantic training data. Super cool to see we can still run reliable evals without having access to the training data!

23.03.2025 23:24 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Excited to be in Munich for my first ACM CS&Law! If you are interested in chatting about memorization + privacy/law in language models, we should hang out :)

23.03.2025 23:09 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
New England NLP Meeting Series

If you're in the northeastern US and you're submitting a paper to COLM on March 27, you should absolutely be sending its abstract to New England NLP on March 28.

19.03.2025 19:59 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0

*Please repost* @sjgreenwood.bsky.social and I just launched a new personalized feed (*please pin*) that we hope will become a "must use" for #academicsky. The feed shows posts about papers filtered by *your* follower network. It's become my default Bluesky experience bsky.app/profile/pape...

10.03.2025 18:14 β€” πŸ‘ 509    πŸ” 292    πŸ’¬ 24    πŸ“Œ 79
Preview
Oxford Word of the Year 2024 - Oxford University Press The Oxford Word of the Year 2024 is 'brain rot'. Discover more about the winner, our shortlist, and 20 years of words that reflect the world.

I'm searching for some comp/ling experts to provide a precise definition of β€œslop” as it refers to text (see: corp.oup.com/word-of-the-...)

I put together a google form that should take no longer than 10 minutes to complete: forms.gle/oWxsCScW3dJU...
If you can help, I'd appreciate your input! πŸ™

10.03.2025 20:00 β€” πŸ‘ 10    πŸ” 8    πŸ’¬ 0    πŸ“Œ 0
Small robot smoking and waving with their right hand

Small robot smoking and waving with their right hand

We’ve been receiving a bunch of questions about a CFP for GenLaw 2025.

We wanted to let you know that we chose not to submit a workshop proposal this year (we need a break!!). We’ll be at ICML though and look forward to catching up there!

You can watch our prior videos!

09.03.2025 20:33 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 2    πŸ“Œ 0
Career Update: Google DeepMind -> Anthropic TODO

Nicholas is leaving GDM at the end of this week, and we're feeling big sad about it: nicholas.carlini.com/writing/2025...

05.03.2025 21:56 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
2025 - ACM Symposium on Computer Science & Law CS&Law 2025 4th ACM Symposium on Computer Science and Law March 25-27, 2025Munich, Germany Submission for lightning talks is open…

Last CFP at ACM CS&Law β€˜25! Please submit your two-minute lightning talks. It’s a great way to advertise work to the community and to find potential new collaborators!

More info (including about registration) on the website: computersciencelaw.org/2025

05.03.2025 19:16 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

This is a joint work with incredibly incredibly wonderful people: Matthew Jagielski, @katherinelee.bsky.social, Niloofar Mireshghallah, @dasmiq.bsky.social, Christopher A. Choquette-Choo!!

02.03.2025 19:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

*Takeaway*: these results underscore the need for more holistic memorization audits, where examples that aren’t extracted at a particular time point are also evaluated for any potential risks. E.g., we find that multiple models have equal or more assisted memorization.

02.03.2025 19:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@jaydeepborkar is following 20 prominent accounts