Mason Nakamura's Avatar

Mason Nakamura

@masonnaka.bsky.social

Thinking about value alignment, RL, multi-agents, and embodied agents https://www.masonnakamura.com/

4 Followers  |  5 Following  |  4 Posts  |  Joined: 11.01.2025  |  1.2937

Latest posts by masonnaka.bsky.social on Bluesky


This work was done in collaboration with
@akumar2709, @saad-ai.bsky.social, Sahar Abdelnabi, Shlomo Zilberstein, and @ebagdasa.bsky.social.
📄paper: arxiv.org/pdf/2510.14312
💻Code: github.com/umass-aisec/...
🌐Project Website: aisec.cs.umass.edu/projects/ter...

30.10.2025 16:12 — 👍 0    🔁 0    💬 0    📌 0
Post image

Our attack evaluation covers confidentiality (info leakage), integrity (adversarial agent & comm‑poisoning), availability (context overflow). Additionally, we integrate 3 cooperative DCOP environments: 📅 MeetingScheduling, 🏡 SmartGrid, and 🧎PersonalAssistant.

30.10.2025 16:12 — 👍 0    🔁 0    💬 1    📌 0
Post image

Why? MASs amplify capability and risks; private data + cross‑agent comms create large attack surfaces. Terrarium provides a controllable, observable sandbox that uses MCP servers and agent-to-agent comm via blackboards for reproducible studies on a new agent paradigm.

30.10.2025 16:12 — 👍 0    🔁 0    💬 1    📌 0
Video thumbnail

🚨New preprint: Terrarium-an open source, blackboard-based testbed for studying safety, privacy & security in LLM multi‑agent systems (MAS). We showcase the vulnerabilities and safety considerations of agentic MASs in this modular and configurable framework. 🧵
#AISafety #LLMAgents #Agents

30.10.2025 16:12 — 👍 5    🔁 0    💬 1    📌 1

@masonnaka is following 4 prominent accounts