Apurv Verma @apurv-verma - Bluesky Profile

Apurv Verma

@apurv-verma.bsky.social

Building safer, more aligned models 🧭 📐 PhD student, NJIT 🎓 | NLP at Bloomberg 🛠️ Website: vermaapurv.com/aboutme/

25 Followers | 144 Following | 3 Posts | Joined: 16.11.2024 | 1.5292

Latest posts by apurv-verma.bsky.social on Bluesky

Watermarking Degrades Alignment in Language Models: Analysis and Mitigation Watermarking techniques for large language models (LLMs) can significantly impact output quality, yet their effects on truthfulness, safety, and helpfulness remain critically underexamined. This paper...

Ever wondered about watermarking's effect on model alignment? 🤔
We found it shifts AI safety behavior. Our fix: generate 2-4 responses, pick the best one 🎯
"Watermarking Degrades Alignment in Language Models" 📄
arxiv.org/abs/2506.04462
#AIResearch #AISafety #Watermarking #LLMs

08.06.2025 01:57 — 👍 4 🔁 1 💬 0 📌 0

This is quite insightful

27.02.2025 01:02 — 👍 0 🔁 0 💬 0 📌 0

How has DeepSeek improved the Transformer architecture? This Gradient Updates issue goes over the major changes that went into DeepSeek’s most recent model.

Very good (technical) explainer answering "How has DeepSeek improved the Transformer architecture?". Aimed at readers already familiar with Transformers.

epoch.ai/gradient-upd...

30.01.2025 21:07 — 👍 282 🔁 64 💬 6 📌 5

Very interesting paper by Ananda Theertha Suresh et al.

For categorical/Gaussian distributions, they derive the rate at which a sample is forgotten to be 1/k after k rounds of recursive training (hence 𝐦𝐨𝐝𝐞𝐥 𝐜𝐨𝐥𝐥𝐚𝐩𝐬𝐞 happens more slowly than intuitively expected)

27.12.2024 23:35 — 👍 35 🔁 5 💬 1 📌 0

I am an AI researcher working on safe AI. My most recent work can be found at arxiv.org/abs/2407.14937. I am trying to connect with other AI researchers on 🦋; follow me here, and I will follow you back.

19.11.2024 02:15 — 👍 0 🔁 0 💬 0 📌 0

@apurv-verma is following 20 prominent accounts

@syhw

Stanford NLP Group
@stanfordnlp

Computational Linguists—Natural Language—Machine Learning

Schmidt Sciences
@schmidtsciences

Bold science, deep and continuous collaborations.

@lutzoettershagen

Assistant Professor at the Department of Computer Science, University of Liverpool. https://lutzoe.github.io/

Paper Skygest Team
@paper-feed

Building personalized Bluesky feeds for academics! Pin Paper Skygest, which serves posts about papers from accounts you're following: https://bsky.app/profile/paper-feed.bsky.social/feed/preprintdigest. By @sjgreenwood.bsky.social and @nkgarg.bsky.social

Mike Godwin
@mnemonic

Lawyer, author, EFF's first hire, Godwin's Law creator (he/him). Retweeting!=endorsing. I tell jokes here, mostly. My opinions here don't necessarily represent any employer or any client. You may have known me as @sfmnemonic on Twitter.

Grant Sanderson
@3blue1brown.com

Math videos

Simon Willison
@simonwillison.net

Independent AI researcher, creator of datasette.io and llm.datasette.io, building open source tools for data journalism, writing about a lot of stuff at https://simonwillison.net/

Jason Weston
@jasonweston

Senior Director, Research Scientist @ Meta FAIR + Visiting Prof @ NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position encode+LLM: MemNet (2015). Recent (2024): Self-Rewarding LLMs & more!

Jeff Dean
@jeffdean

Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...

Noam Brown
@polynoamial

Researching reasoning at OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series / 🍓

Guillaume
@guillaume-garrigos.com

Chaque soir: Tente de conquérir le monde. Le reste du temps: MCF (Paris). @GuillaumeG_ sur X 🇨🇵 🇬🇧 🇪🇸 🇮🇹

Sébastien Darses
@sebdarses

Math Assoc. Prof. (On leave, Aix-Marseille, France) Teaching Project (non-profit): https://highcolle.com/

Nikos Karalias
@stalence

postdoc at MIT CSAIL working on solving combinatorial problems with neural networks

Alex Warstadt
@alexwarstadt

Asst Prof. @ UCSD | PI of LeM🍋N Lab | Former Postdoc at ETH Zürich, PhD @ NYU | computational linguistics, NLProc, CogSci, pragmatics | he/him 🏳️‍🌈 alexwarstadt.github.io

Simons Institute for the Theory of Computing
@simonsinstitute

The world's leading venue for collaborative research in theoretical computer science. Follow us at http://YouTube.com/SimonsInstitute.

Albert Vilella, PhD.
@albertvilella

Bioinformatics Scientist / Next Generation Sequencing, Single Cell and Spatial Biology, Next Generation Proteomics, Liquid Biopsy, SynBio, Compute Acceleration in biotech // http://albertvilella.substack.com

Julien Cornebise
@jcornebise

Hon. Associate Professor UCL CS | Ex-Dir. Research AI for Good & Head of Element AI London Office | Ex-DeepMind. He/Him | https://cornebise.com

Tuhin Chakrabarty
@tuhinchakr

Incoming Assistant Prof @sbucompsc @stonybrooku. Researcher → @SFResearch Ph.D. → @ColumbiaCompSci Human Centered AI / Future of Work / AI & Creativity

Aidan Gomez
@aidangomez

Cohere