Incoming: Dreadnode paper drop from Shane Caldwell and the crew.
PentestJudgeβJudging Agent Behavior Against Operational Requirements: arxiv.org/abs/2508.02921
Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents (inspired by PaperBench).
06.08.2025 18:30 β π 0 π 0 π¬ 0 π 0
β After talking AI Action Plan on @cyberscoop.bsky.social, wrote up @dreadnode.bsky.social thoughts on implementation β‘οΈ dreadnode.io/blog/five-ta...
βΌοΈ While we debate frameworks, adversaries build AI attack capabilities. We need: evaluation ecosystems, red teaming, and procurement standards.
01.08.2025 23:48 β π 0 π 1 π¬ 0 π 0
Building and Deploying Offensive Security Agents with Dreadnode
YouTube video by Off By One Security
Rise and shine! We're going live on Off By One with Stephen Sims this afternoonβmeet us here at 11 AM PT: www.youtube.com/live/BzOmGw-...
25.07.2025 15:06 β π 0 π 0 π¬ 0 π 0
At Military Cyber Professionals Association's #HammerCon event today? Hear Daria present on this topic at 2 PM in the Growing Innovations in Tech (GIT) track, or connect with the crew at our booth!
26.06.2025 17:14 β π 2 π 0 π¬ 0 π 0
Introducing AIRTBench, an AI red teaming benchmark for evaluating language modelsβ ability to autonomously discover and exploit AI/ML security vulnerabilities.
Read the paper on arXiv: arxiv.org/abs/2506.14682
Open-source dataset and benchmark eval code repo: github.com/dreadnode/AI...
18.06.2025 13:24 β π 3 π 1 π¬ 1 π 0
v3 of Rigging is out now. If youβre working with LLMs to build agents or run evaluations, check it out. We just added:
- Prompt caching for supported providers
- A unified tool system for function calling and fallbacks to xml/json parsing
- Native MCP integration
docs.dreadnode.io/open-source/...
19.05.2025 15:10 β π 3 π 2 π¬ 0 π 0
Introducing our new blog series: "From Compute to Congress: Decoding AI Policy" by Dreadnode Head of Policy Daria Bahrami | Read the first post here: dreadnode.io/blog/from-co...
15.05.2025 16:50 β π 1 π 1 π¬ 0 π 1
Are manual or automated attacks more effective when attacking LLMs?
We found that automated approaches achieve significantly higher success rates (69.5%) compared to manual techniques (47.6%).
More insights on LLM attack execution methods here π dreadnode.io/blog/the-aut...
08.05.2025 15:30 β π 1 π 0 π¬ 0 π 0
Strikes waitlist. Now open.
platform.dreadnode.io/waitlist/str...
[must have a Dreadnode account]
01.05.2025 19:50 β π 2 π 1 π¬ 0 π 1
What's your take on the growing dominance of automated attacks and the implications for AI red teams? Here's oursβ based on our analysis of 30 LLM challenges, attempted by 1,674 unique Crucible users, across 214,271 attack attempts: arxiv.org/abs/2504.19855
29.04.2025 16:14 β π 4 π 5 π¬ 0 π 1
YouTube video by CyberScoop
Dreadnode CEO Will Pearce on the ever-changing field of offensive AI security
@moohax.bsky.social joins @gregotto.bsky.social on CyberScoop's Safe Mode podcast! Tune in at the 10-minute mark for a discussion on how AI fits into the offensive security narrative and what it means for tooling and defenses: www.youtube.com/watch?v=ZReR...
21.04.2025 21:35 β π 1 π 0 π¬ 0 π 0
Headed to RSA? Come meet the Dreadnode crew!
Whether you're looking for a private deep dive into our tech or want to hang out and talk offensive AI research, we'd love to connect.
Limited availability; Come and get it: calendly.com/tori-dreadno...
#BayArea #SanFrancisco #RSAC2025 #OffensiveAI
16.04.2025 16:12 β π 1 π 1 π¬ 0 π 0
Hey, we know that guy! Catch Dreadnode's @radads.bsky.social on NASDAQ #TradeTalks alongside @bugcrowd.com CEO
@davegerryjr.bsky.social and NFL CISO @tomasmald.bsky.social.
Tune in for a candid conversation on the intersection of AI and cybersecurity: www.nasdaq.com/videos/ever-...
09.04.2025 20:07 β π 5 π 2 π¬ 0 π 0
Will be talking about @dreadnode.bsky.socialβs great open-source rigging repo and how to build your own LLM workflows! Super excited!
03.04.2025 14:46 β π 3 π 1 π¬ 0 π 0
ππͺβΎοΈπ¦₯π₯ππ€¨π
8 new Challenges now live in Crucible: platform.dreadnode.io/crucible
These Challenges might look familiarβ¦ they first appeared at DEFCON 30 and were recently refactored for Crucibleβenjoy! [Filter>Subject>DEFCON-30]
26.03.2025 20:02 β π 2 π 1 π¬ 0 π 0
Dreadnodeβs Policy Recommendations for the U.S. AI Action Plan
Read Dreadnodeβs AI policy recommendations for the U.S. AI Action Plan, which focuses on leveraging AI to protect America and attacking AI to find its limits.
New blog: Dreadnodeβs Policy Recommendations for the U.S. AI Action Plan. Our response focuses on two critical strategies:
1οΈβ£ Leveraging AI to protect America
2οΈβ£ Attacking AI to find its limits
Read our complete response on the Dreadnode blog: dreadnode.io/blog/policy-...
26.03.2025 16:17 β π 2 π 1 π¬ 0 π 1
Shoutout to these three Crucible users, who were first to solve this week's new Phantom Cheque Challenge. πππ
1. conor-99
2. Bilal
3. ken
Cheque it out: platform.dreadnode.io/crucible/pha...
14.03.2025 16:15 β π 1 π 0 π¬ 0 π 0
Cheque, check, one-two. We have a new Crucible Challenge for you: Phantom Cheque! Can you evade the cheque scanner and determine the areas of JagaLLM that need to be improved?
Act fast; first three to solve this model extraction Challenge announced Friday: platform.dreadnode.io/crucible/pha...
11.03.2025 20:11 β π 2 π 0 π¬ 0 π 0
canβt recommend @dreadnode.bsky.social enough - learning a lot going through the challenges and docs
01.03.2025 18:09 β π 2 π 1 π¬ 0 π 0
GitHub - dreadnode/dyana: A sandbox environment designed for loading, running and profiling a wide range of files, including machine learning models, ELFs, Pickle, Javascript and more
A sandbox environment designed for loading, running and profiling a wide range of files, including machine learning models, ELFs, Pickle, Javascript and more - dreadnode/dyana
Dyana on GitHub: github.com/dreadnode/dy...
04.03.2025 17:06 β π 1 π 0 π¬ 0 π 0
In this week's new Crucible Challenge, find the hidden phrase in the backdoored model using dyana, an open source tool created by Dreadnode's Ads Dawson.
Can you outwit the llamas? platform.dreadnode.io/crucible/dya...
04.03.2025 17:04 β π 1 π 0 π¬ 1 π 0
Dreadnode Secures $14M to Build AI Systems that Advance the State of Offensive Security
Dreadnode Raises $14M to Advance Offensive Security | Series A Announcement
Big news from our crew today! We announced our $14M Series A funding led by Decibel with participation from Next Frontier Capital, In-Q-Tel (IQT), Sands Capital, and Indie VC and released two new solutions: Strikes and Spyglass.
Read the announcement: dreadnode.io/blog/series-...
25.02.2025 14:20 β π 10 π 3 π¬ 0 π 2
Join the dreadnode Discord Server!
Check out the dreadnode community on Discord - hang out with 946 other members and enjoy free voice and text chat.
PSA: We have an active Discord with nearly 1000 members. Join our channel to get tips, discuss Challenges, and connect with others in the community: discord.gg/AxzwtdCN
18.02.2025 21:55 β π 2 π 0 π¬ 0 π 0
Raiders of the Lost AI: Attempt our new Crucible Challenge, Palimpsest! Decode the hidden message in the scroll, find the flag.
First three to solve will be announced Friday, right here.
Get started: crucible.dreadnode.io/challenges/p...
18.02.2025 21:49 β π 2 π 0 π¬ 1 π 0
Kudos to these individuals for killing this weekβs Crucible Challenge. First three to solve Popcorn:
1οΈβ£Β conor-99
2οΈβ£Β garr
3οΈβ£Β mejokim
Have you attempted Popcorn yet? Enter Crucible: crucible.dreadnode.io/challenges/p...
14.02.2025 20:25 β π 3 π 0 π¬ 0 π 0
Cybersecurity reporter at Bloomberg News in DC. Signal: @howelloneill.01, email: patoneill1@bloomberg.net
PhD candidate @ JHU Alperovitch Institute ; AI Research Scientist @ Dreadnode
Executive Director for Intelligence and Security Research @ SentinelOne.
Distinguished Fellow and Adj Professor @ Hopkins SAIS Alperovitch Institute. Three Buddy Problem Co-Host. LABScon Founder, Cyber Paleontologist, Fourth-Party Collector.
Data & Society is a nonprofit research institute that studies the social implications of data-centric technologies, automation, and AI.
Microsoft AI Red Team
Former Tweep
Writer for WIRED. Author of SANDWORM. New book, TRACERS IN THE DARK: The Global Hunt for the Crime Lords of Cryptocurrency, out now. agreenberg@wired.com. Andy.01 on Signal.
since 1985
https://phrack.org
Head of Red team @ IBM X-Force. Black Hat Review Board. Founder and co-organizer of Offensive AI Con. Co-Founder of RemoteThreat. inveni et usurpa
Bringing AI to offensive security by autonomously finding and exploiting web vulnerabilities. Watch XBOW hack things: https://xbow.com/traces
Using bad guys to catch math since 2010.
Principal Security Architect (AI/ML) and AI Red Team at NVIDIA.
He/him. Personal account etc; `from std_disclaimers import *`
Safe AI starts with Secure AI.
Create and share social media content anywhere, consistently.
Built with π by a global, remote team.
β¬οΈ Learn more about Buffer & Bluesky
https://buffer.com/bluesky
AI Security @ NVIDIA
OSS Security @ Project Jupyter and NumFOCUS
https://developer.nvidia.com/blog/author/jolucas/
Now Google Threat Intelligence & doing fun things at DistrictCon, fmrly GreyNoiseIO and RecordedFuture, SAISHopkins MASCI alumna | β‘s & rts are my own, my employer definitely doesnβt like Taylor Swift that much
official Bluesky account (check usernameπ)
Bugs, feature requests, feedback: support@bsky.app