Edoardo Debenedetti @NeurIPS @edebenedetti

Edoardo Debenedetti @NeurIPS

@edebenedetti.bsky.social

PhD student at ETH Zurich | Student Researcher at Google | Agents Security and more in general ML Security and Privacy edoardo.science spylab.ai

234 Followers | 61 Following | 2 Posts | Joined: 21.11.2023 | 1.4066

Latest posts by edebenedetti.bsky.social on Bluesky

I am at NeurIPS 🇨🇦, please reach out if you want to grab a coffee!

12.12.2024 22:36 — 👍 4 🔁 2 💬 0 📌 0

SPY Lab is in Vancouver for NeurIPS! Come say hi if you see us around 🕵️

10.12.2024 19:43 — 👍 10 🔁 2 💬 1 📌 1

I'm in Vancouver for NeurIPS! Feel free to reach out if you wanna meet to chat about security and privacy, especially in the context of LLM agents!

10.12.2024 14:59 — 👍 0 🔁 0 💬 0 📌 0

Come do open AI with us in Zurich!
We're hiring PhD students, postdocs (and faculty!)

04.12.2024 13:49 — 👍 11 🔁 3 💬 0 📌 1

Feel free to recommend @javirandor.com more researchers to add to the list!

04.12.2024 11:31 — 👍 3 🔁 0 💬 0 📌 0

Apropos of today's Overleaf downtime/slowness: remember to have your files backed up on Github or locally! What if this happened on the day of a conference deadline?

03.12.2024 16:14 — 👍 17 🔁 2 💬 1 📌 0

Anyone may be able to compromise LLMs with malicious content posted online. With just a small amount of data, adversaries can backdoor chatbots to become unusable for RAG, or bias their outputs towards specific beliefs. Check our latest work! 👇🧵

25.11.2024 12:27 — 👍 5 🔁 2 💬 1 📌 1

Gradient Masking All-at-Once: Ensemble Everything Everywhere Is Not Robust Ensemble everything everywhere is a defense to adversarial examples that was recently proposed to make image classifiers robust. This defense works by ensembling a model's intermediate representations...

Ensemble Everything Everywhere is a defense against adversarial examples that people got quite exited about a few months ago (in particular, the defense causes "perceptually aligned" gradients just like adversarial training)

Unfortunately, we show it's not robust...

arxiv.org/abs/2411.14834

25.11.2024 08:38 — 👍 28 🔁 9 💬 1 📌 0

@edebenedetti is following 20 prominent accounts

Catherine Regis
@catherineregis

Law professor at Université de Montréal, Canada CIFAR Chair in AI and Human Rights, Canada research chair in Health Law and Policy, Academic Member at Mila, Director of social innovation and international policy at IVADO

Ana-Maria Cretu
@ana-mariacretu

Tenure-track faculty at CISPA. Previously a post-doc at EPFL studying privacy and safety harms in data-driven systems and PhD in data privacy at Imperial College London. https://ana-mariacretu.github.io/

Mislav Balunovic
@mbai

AI for Math

Jakub Łucki
@jakublucki

Visiting Researcher at NASA JPL | Data Science MSc at ETH Zurich

@tongwu-princeton

Jie Zhang
@jiezhang-ethz

PhD student at ETH Zurich, working on ML privacy and security https://zj-jayzhang.github.io/

Ahmad Beirami
@abeirami

stealth // Gemini RL+inference @ Google DeepMind // Conversational AI @ Meta // RL Agents @ EA // ML+Information Theory @ MIT+Harvard+Duke // Georgia Tech PhD 📍{NYC, SFO, YYZ} 🔗 https://beirami.github.io/

Chawin Sitawarin
@chawins

Postdoc @Meta (Privacy-Preserving ML | Central Applied Science). PhD CS @UCBerkeley. ML security 👹 privacy 👀 robustness 🛡 Views are my own.

Tinghao Xie (✈️ Neurips)
@tinghaox

3rd year Phd candidate @ Princeton ECE

Kristina Nikolić
@nkristina

PhD student at ETH Zurich, working on AI safety. Cambridge MPhil in ML graduate | Alumnus of Mathematical Grammar School | from Serbia

Boyi Wei
@boyiwei

PhD Student @Princeton

garak, LLM Vulnerability Scanner
@garak-llm

https://garak.ai

Mario Seminerio
@phastidio.net

Opinions are my (cl)own https://linktr.ee/marioseminerio editor [@] phastidio [.] net

Hanna Yukhymenko
@ayukh

Statistics MSc @ ETH Zurich Multilingual LLM training/eval/safety @ SRI lab ayukh.com

Bogdan Kulynych
@bogdankulynych

researcher studying privacy, security, reliability, and broader social implications of algorithmic systems · fake doctor working at a real hospital website: https://kulyny.ch

Jeremy Howard
@howard.fm

https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…

Nathan Lambert
@natolambert

A LLN - large language Nathan - (RL, RLHF, society, robotics), athlete, yogi, chef Writes http://interconnects.ai At Ai2 via HuggingFace, Berkeley, and normal places

Clem Delangue 🤗
@clem.hf.co

Co-founder and CEO at Hugging Face

Julien Chaumond
@julien-c.hf.co

I build tools that propel communities forward

Giada Pistilli
@giadapistilli.com

Philosopher in tech, currently at Mistral AI. Doctor of talking machines, now teaching them good behavior.