Dreadnode's Avatar

Dreadnode

@dreadnode.bsky.social

Building AI systems that advance the state of offensive security | https://www.dreadnode.io/

99 Followers  |  16 Following  |  44 Posts  |  Joined: 22.11.2024  |  1.8611

Latest posts by dreadnode.bsky.social on Bluesky

Post image

Incoming: Dreadnode paper drop from Shane Caldwell and the crew.

PentestJudgeβ€”Judging Agent Behavior Against Operational Requirements: arxiv.org/abs/2508.02921

Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents (inspired by PaperBench).

06.08.2025 18:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

✍ After talking AI Action Plan on @cyberscoop.bsky.social, wrote up @dreadnode.bsky.social thoughts on implementation ➑️ dreadnode.io/blog/five-ta...

‼️ While we debate frameworks, adversaries build AI attack capabilities. We need: evaluation ecosystems, red teaming, and procurement standards.

01.08.2025 23:48 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Evals: The Foundation for Autonomous Offensive Security Learn how to build robust evaluations for autonomous red team agents that can perform Windows Active Directory operations. This blog covers action space design, programmatic verification, and measurin...

In our latest blog, Shane Caldwell breaks down the process of creating a fully integrated, self-verifying agentic system that can do modern Windows Active Directory red team operations, without human interaction.

Read it here: dreadnode.io/blog/evals-t...

01.08.2025 18:14 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Building and Deploying Offensive Security Agents with Dreadnode YouTube video by Off By One Security

Rise and shine! We're going live on Off By One with Stephen Sims this afternoonβ€”meet us here at 11 AM PT: www.youtube.com/live/BzOmGw-...

25.07.2025 15:06 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

At Military Cyber Professionals Association's #HammerCon event today? Hear Daria present on this topic at 2 PM in the Growing Innovations in Tech (GIT) track, or connect with the crew at our booth!

26.06.2025 17:14 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
From Compute to Congress: Setting the Global Standard for AI Security Daria explores how the TEST AI Act and red teaming standards can establish American leadership in AI securityβ€”a winning policy roadmap from Critical Effect DC 2025.

In this edition of our From Compute to Congress policy blog series, Dreadnode Head of Policy Daria Bahrami explores how the TEST AI Act and red teaming standards can establish U.S. leadership in AI security: dreadnode.io/blog/from-co...

26.06.2025 17:12 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
AI Red Teaming Case Study: Claude 3.7 Sonnet Solves the Turtle Challenge See how Claude solved a notoriously difficult AI/ML CTF challenge, going beyond pattern matching to genuine problem-solving under adversarial conditions.

Read @rad-ads.bsky.social's breakdown of Claude's attack sequence against the notoriously hard-to-solve "turtle" challenge: dreadnode.io/blog/ai-red-...

25.06.2025 15:47 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Introducing AIRTBench, an AI red teaming benchmark for evaluating language models’ ability to autonomously discover and exploit AI/ML security vulnerabilities.

Read the paper on arXiv: arxiv.org/abs/2506.14682

Open-source dataset and benchmark eval code repo: github.com/dreadnode/AI...

18.06.2025 13:24 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
GitHub - vmsv/pivot2025-llmworkshop Contribute to vmsv/pivot2025-llmworkshop development by creating an account on GitHub.

Check out @machinavelli.com's "Build with AI" Rigging workshop from @pivotcon.bsky.social: github.com/vmsv/pivot20...

20.05.2025 15:16 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

v3 of Rigging is out now. If you’re working with LLMs to build agents or run evaluations, check it out. We just added:

- Prompt caching for supported providers
- A unified tool system for function calling and fallbacks to xml/json parsing
- Native MCP integration

docs.dreadnode.io/open-source/...

19.05.2025 15:10 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Introducing our new blog series: "From Compute to Congress: Decoding AI Policy" by Dreadnode Head of Policy Daria Bahrami | Read the first post here: dreadnode.io/blog/from-co...

15.05.2025 16:50 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Post image

Are manual or automated attacks more effective when attacking LLMs?

We found that automated approaches achieve significantly higher success rates (69.5%) compared to manual techniques (47.6%).

More insights on LLM attack execution methods here πŸ‘‰ dreadnode.io/blog/the-aut...

08.05.2025 15:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Strikes waitlist. Now open.

platform.dreadnode.io/waitlist/str...

[must have a Dreadnode account]

01.05.2025 19:50 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Post image

What's your take on the growing dominance of automated attacks and the implications for AI red teams? Here's oursβ€” based on our analysis of 30 LLM challenges, attempted by 1,674 unique Crucible users, across 214,271 attack attempts: arxiv.org/abs/2504.19855

29.04.2025 16:14 β€” πŸ‘ 4    πŸ” 5    πŸ’¬ 0    πŸ“Œ 1
Dreadnode CEO Will Pearce on the ever-changing field of offensive AI security
YouTube video by CyberScoop Dreadnode CEO Will Pearce on the ever-changing field of offensive AI security

@moohax.bsky.social joins @gregotto.bsky.social on CyberScoop's Safe Mode podcast! Tune in at the 10-minute mark for a discussion on how AI fits into the offensive security narrative and what it means for tooling and defenses: www.youtube.com/watch?v=ZReR...

21.04.2025 21:35 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Headed to RSA? Come meet the Dreadnode crew!

Whether you're looking for a private deep dive into our tech or want to hang out and talk offensive AI research, we'd love to connect.

Limited availability; Come and get it: calendly.com/tori-dreadno...

#BayArea #SanFrancisco #RSAC2025 #OffensiveAI

16.04.2025 16:12 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Hey, we know that guy! Catch Dreadnode's @radads.bsky.social on NASDAQ #TradeTalks alongside @bugcrowd.com CEO
@davegerryjr.bsky.social and NFL CISO @tomasmald.bsky.social.

Tune in for a candid conversation on the intersection of AI and cybersecurity: www.nasdaq.com/videos/ever-...

09.04.2025 20:07 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Will be talking about @dreadnode.bsky.socialβ€˜s great open-source rigging repo and how to build your own LLM workflows! Super excited!

03.04.2025 14:46 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

🌭πŸ”ͺ⚾️πŸ¦₯πŸ”₯πŸ”„πŸ€¨πŸ›œ

8 new Challenges now live in Crucible: platform.dreadnode.io/crucible

These Challenges might look familiar… they first appeared at DEFCON 30 and were recently refactored for Crucibleβ€”enjoy! [Filter>Subject>DEFCON-30]

26.03.2025 20:02 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Dreadnode’s Policy Recommendations for the U.S. AI Action Plan Read Dreadnode’s AI policy recommendations for the U.S. AI Action Plan, which focuses on leveraging AI to protect America and attacking AI to find its limits.

New blog: Dreadnode’s Policy Recommendations for the U.S. AI Action Plan. Our response focuses on two critical strategies:

1️⃣ Leveraging AI to protect America
2️⃣ Attacking AI to find its limits

Read our complete response on the Dreadnode blog: dreadnode.io/blog/policy-...

26.03.2025 16:17 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 1
Preview
Live Stream with Dreadnode Founders | LinkedIn πŸ“… Date: Wednesday, March 19, 2025 | ⏰ Time: 10 AM PT / 1 PM ET Dreadnode, the company at the forefront of offensive AI research and development, recently announced its Series A funding announcement ...

We're LIVE: www.linkedin.com/events/lives...

19.03.2025 17:05 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Shoutout to these three Crucible users, who were first to solve this week's new Phantom Cheque Challenge. πŸ‘πŸ‘πŸ‘

1. conor-99
2. Bilal
3. ken

Cheque it out: platform.dreadnode.io/crucible/pha...

14.03.2025 16:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Cheque, check, one-two. We have a new Crucible Challenge for you: Phantom Cheque! Can you evade the cheque scanner and determine the areas of JagaLLM that need to be improved?

Act fast; first three to solve this model extraction Challenge announced Friday: platform.dreadnode.io/crucible/pha...

11.03.2025 20:11 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

can’t recommend @dreadnode.bsky.social enough - learning a lot going through the challenges and docs

01.03.2025 18:09 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - dreadnode/dyana: A sandbox environment designed for loading, running and profiling a wide range of files, including machine learning models, ELFs, Pickle, Javascript and more A sandbox environment designed for loading, running and profiling a wide range of files, including machine learning models, ELFs, Pickle, Javascript and more - dreadnode/dyana

Dyana on GitHub: github.com/dreadnode/dy...

04.03.2025 17:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

In this week's new Crucible Challenge, find the hidden phrase in the backdoored model using dyana, an open source tool created by Dreadnode's Ads Dawson.

Can you outwit the llamas? platform.dreadnode.io/crucible/dya...

04.03.2025 17:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Dreadnode Secures $14M to Build AI Systems that Advance the State of Offensive Security Dreadnode Raises $14M to Advance Offensive Security | Series A Announcement

Big news from our crew today! We announced our $14M Series A funding led by Decibel with participation from Next Frontier Capital, In-Q-Tel (IQT), Sands Capital, and Indie VC and released two new solutions: Strikes and Spyglass.

Read the announcement: dreadnode.io/blog/series-...

25.02.2025 14:20 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 0    πŸ“Œ 2
Join the dreadnode Discord Server! Check out the dreadnode community on Discord - hang out with 946 other members and enjoy free voice and text chat.

PSA: We have an active Discord with nearly 1000 members. Join our channel to get tips, discuss Challenges, and connect with others in the community: discord.gg/AxzwtdCN

18.02.2025 21:55 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Raiders of the Lost AI: Attempt our new Crucible Challenge, Palimpsest! Decode the hidden message in the scroll, find the flag.

First three to solve will be announced Friday, right here.

Get started: crucible.dreadnode.io/challenges/p...

18.02.2025 21:49 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Kudos to these individuals for killing this week’s Crucible Challenge. First three to solve Popcorn:

1️⃣ conor-99
2️⃣ garr
3️⃣ mejokim

Have you attempted Popcorn yet? Enter Crucible: crucible.dreadnode.io/challenges/p...

14.02.2025 20:25 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@dreadnode is following 16 prominent accounts