How and when should LLM guardrails be deployed to balance safety and user experience?
Our #EMNLP2025 paper reveals that crafting thoughtful refusals rather than detecting intent is the key to human-centered AI safety.
📄 arxiv.org/abs/2506.00195
🧵[1/9]
20.10.2025 20:04 — 👍 8 🔁 3 💬 1 📌 0
GitHub - EEElisa/LLM-Guardrails
Contribute to EEElisa/LLM-Guardrails development by creating an account on GitHub.
[9/9] Big THANKS to my amazing collaborators @jiajiah.bsky.social @pigeonzow.bsky.social Motahhare Eslami, Jena Hwang @faebrahman.bsky.social, @carolynrose.bsky.social @maartensap.bsky.social from @ltiatcmu.bsky.social
Pareto.ai @sfu.ca @ai2.bsky.social ♥️
📂 github.com/EEElisa/LLM-Guardrails
20.10.2025 20:04 — 👍 1 🔁 1 💬 0 📌 0
Me with my cat on the plane
Some life updates here: got a car, managed to escape grad school, working on expert data curation and future of work these days, had a new cat, moved back to SF, finally feels alive now.
02.06.2025 02:13 — 👍 4 🔁 0 💬 0 📌 0
I’m back!
02.06.2025 02:01 — 👍 1 🔁 0 💬 0 📌 0
Look at my kitty ❤️❤️
07.01.2024 06:24 — 👍 10 🔁 1 💬 1 📌 0
Ok, guys I cleared out the fridge and I… OH NO
19.05.2023 01:06 — 👍 236 🔁 42 💬 20 📌 2
Thank u!
04.06.2023 04:20 — 👍 4 🔁 0 💬 0 📌 0
Got Covid again 😢
31.05.2023 18:53 — 👍 3 🔁 0 💬 0 📌 0
Skeet!
06.05.2023 03:37 — 👍 0 🔁 0 💬 0 📌 0
Moss shower mat?
05.05.2023 22:04 — 👍 73 🔁 8 💬 8 📌 1
Vibe > existing follower count in getting new followers
05.05.2023 20:50 — 👍 7 🔁 1 💬 0 📌 0
People tweeting vs people skeeting
05.05.2023 20:37 — 👍 6 🔁 2 💬 0 📌 0
People skeeting
05.05.2023 20:37 — 👍 1 🔁 0 💬 0 📌 0
OMG Casey Neistat on bsky! Welcome Casey!
05.05.2023 17:42 — 👍 1 🔁 0 💬 0 📌 0
From what I’ve see, most proposed LLM based tool is really just as a proof of concept. If all the “moderation tool” do is to prompt the GPT to produce a formatted JSON file, there’s no way to tune and no barriers of entry.
05.05.2023 16:27 — 👍 2 🔁 2 💬 0 📌 0
01.05.2023 21:07 — 👍 18 🔁 5 💬 2 📌 0
✨
04.05.2023 05:07 — 👍 1 🔁 0 💬 0 📌 0
Repost as reminder 👀
04.05.2023 04:54 — 👍 2 🔁 0 💬 0 📌 0
The memes are too good
04.05.2023 04:51 — 👍 1 🔁 0 💬 0 📌 0
/honk Duck is getting out of control
04.05.2023 04:46 — 👍 1 🔁 0 💬 1 📌 0
following the protocols summer like 👀
04.05.2023 04:44 — 👍 1 🔁 0 💬 0 📌 0
The eternal September is pretty awesome after all.
04.05.2023 04:42 — 👍 1 🔁 0 💬 0 📌 0
/squeeee? 🤔
04.05.2023 04:36 — 👍 0 🔁 0 💬 2 📌 0
Bsky post giving out jazzing vibe ❤️
04.05.2023 04:35 — 👍 0 🔁 0 💬 0 📌 0
Can confirm.
04.05.2023 01:35 — 👍 72 🔁 6 💬 6 📌 0
Must! Share! Capybaras!
04.05.2023 04:29 — 👍 1 🔁 0 💬 1 📌 0
Arram ✨
04.05.2023 04:28 — 👍 1 🔁 0 💬 0 📌 0
This too, shall pass 🧘
04.05.2023 04:26 — 👍 5 🔁 0 💬 0 📌 0
Twitter right now: Apollo gets crazy after getting shot in the heart. 🙈
04.05.2023 04:24 — 👍 2 🔁 0 💬 0 📌 0
Oh this is the Commons!
04.05.2023 04:20 — 👍 1 🔁 0 💬 0 📌 0
Photographer, Ocean Adventurer , graduate architect, ocean athlete hunting the craziest waves on earth.
Researcher of online rumors & disinformation. Former basketball player. Prof at University of Washington, HCDE. Co-founder of the UW Center for an Informed Public. Personal account: Views may not reflect those of my employer. #RageAgainstTheBullshitMachine
Top news and commentary for technology's leaders, from all around the web.
This account shares top-level Techmeme headlines. Visit https://techmeme.com/ for full context.
Internet Archive is a non-profit research library preserving web pages, books, movies & audio for public access. Explore web history via the Wayback Machine.
HCI researcher, designer, theatre artist, publisher @ writlargeprojects.com
#hEDS, #MCAS, #POTS, #disability
Every few hours I show you a tiny aquarium with interesting fish. Please do not tap the glass.
By @JoeSondow.bsky.social
Every few hours I show you a tiny meadow full of grass and flowers.
By @JoeSondow.bsky.social
Professor of HCII and LTI at Carnegie Mellon School of Computer Science.
jeffreybigham.com
Doing open source-y stuff, probably full of bees. opensourcestories.org co-founder. Human to Luna Rae Muppet Show.
she/her 🏳️🌈
The haiku collector.
https://en.wikipedia.org/wiki/Haiku
フォロー頂いた皆さんの投稿を俳句判定します。
悪用などを見付けられた場合は @mattn.bsky.social まで。
President of Signal, Chief Advisor to AI Now Institute
Workforce Economist in Residence at Guild; Senior Fellow at the Burning Glass Institute. I tweet a lot about labor markets, macro, and (sorry) music! Tweets represent my own views.
the once and future city planner // senior legislative director for california YIMBY // proud kentuckian // #BBN // buy my book ❤
Co-Founder shillr.xyz | Accidental Art Collector | Curator of Vibes ✨🍄
Banner by https://bsky.app/profile/efdot.bsky.social
Art Collector? Biscuit King. Sierra Nevada is my brew.
@clickcreate
«⠀𝗧𝗢𝗞𝗬𝗢⠀𝗦𝗧𝗥𝗘𝗘𝗧⠀𝗣𝗛𝗢𝗧𝗢𝗚𝗥𝗔𝗣𝗛𝗬⠀»
Photographer⠀×⠀Creator⠀×⠀Degen
▲●■ DƎFAULT TOKYO
https://foundation.app/collection/default3
🗼 ᵍᵐ ᴺᶠᵀ 🔗 https://linktr.ee/oneloudimage
I can think. I can wait. I can fast.