Earlier this month I had a great conversation about #AI and #security over lunch with Apolline Rolland. You can read about it here virtual-routes.org/ai-over-lunc...
Hopefully you can find some useful insights in my (very well edited) ramblings!
@dr-bly.bsky.social
AI Safety and Security. Fellow @ CSET | Georgetown. CS/AI PhD. Nerd.
Earlier this month I had a great conversation about #AI and #security over lunch with Apolline Rolland. You can read about it here virtual-routes.org/ai-over-lunc...
Hopefully you can find some useful insights in my (very well edited) ramblings!
This is an awesome opportunity.
Come work with awesome researchers (and me, too), tackle the thorniest debates in AI, and make real impact!
Yesterday I taught my first class. I'm officially a teacher (a professor, even)! I'm very excited to be teaching AI policy to undergrads at Georgetown this semester.
03.09.2025 20:29 β π 1 π 0 π¬ 0 π 0CSET is hiring! Be sure to apply by Tuesday!
Join our data team as a Data Research Analyst to contribute to data-driven research products and policy analysis at the forefront of national security and tech policy.
cset.georgetown.edu/job/data-res...
Banning state-level AI regulation is a bad idea!
One crucial reason is that states play a critical role in building AI governance infrastructure.
Check out this new op-ed by @jessicaji.bsky.social, myself, and @minanrn.bsky.social on this topic!
thehill.com/opinion/tech...
2 weeks left on this open funding call on risks from internal deployments of frontier AI modelsβsubmissions are due June 30.
Expressions of interest only need to be 1-2 pages, so still time to write one up!
Full details: cset.georgetown.edu/wp-content/u...
π‘Funding opportunityβshare with your AI research networksπ‘
Internal deployments of frontier AI models are an underexplored source of risk. My program at @csetgeorgetown.bsky.social just opened a call for research ideasβEOIs due Jun 30.
Full details β‘οΈ cset.georgetown.edu/wp-content/u...
Summary β¬οΈ
A bar chart of zodiac signs among popes. Aries: 4, Taurus: 7, Gemini: 5, Cancer: 4, Leo: 4, Virgo: 4, Libra: 4, Scorpio: 4, Sagittarius: 5, Capricorn: 5, Aquarius: 4, Pisces: 8
Pope Leo XIV is a rare Virgo β
08.05.2025 18:45 β π 1 π 0 π¬ 0 π 0I've been keeping a list of organizations that do AI red-teaming. Sources are blog and job postings. Definitely an incomplete list. Potentially out of date. I might do a write-up about it. docs.google.com/document/d/1...
30.04.2025 19:06 β π 1 π 0 π¬ 0 π 0"We all rely on science [...] Businesses and farmers rely on science and engineering for product innovation, technological advances, and weather forecasting. Science helps humanity protect the planet and keeps pollutants and toxins out of our air, water, and food."
docs.google.com/document/d/1...
The audience asked a ton of great questions. We couldn't get to all of them, but I'll be reading through them all and will try to answer some in upcoming research!
26.03.2025 18:22 β π 0 π 0 π¬ 0 π 0ICYMI: The CSET webinar on AI red-teaming has been recorded!
www.youtube.com/watch?v=gDnN...
Watch this for a great discussion on what AI red-teaming is, how different organizations do it, and how it can be improved!
Huge thanks to the panelists, moderator, and audience!
Almost everything we care about in AI comes down to evaluations, including #redteaming.
Jessica Ji's post lays out a path for making AI red-teaming better. cset.georgetown.edu/article/how-...
Starting in 5 π
25.03.2025 15:55 β π 1 π 1 π¬ 0 π 0Here's the report: chatgpt.com/share/67db85...
It shows the prompt I used, some follow-up questions ChatGPT asked, and my attempt to get some decent pun-based names for the bracket.
A filled march madness bracket showing the first seed teams (Duke, Houston, Auburn, and Florida) going to the final four, and showing Duke going all the way.
π I know nothing about college basketball, so I decided to be the avatar for ChatGPT in the @csetgeorgetown.bsky.social #MarchMadness tournament. I used Deep research on o3-mini-high to generate a detailed report and analysis of the tournament. The resulting bracket is very conservative. Go Humans!
20.03.2025 16:08 β π 0 π 0 π¬ 1 π 0β¨ NEW: How can the U.S. stay ahead in AI?
OSTP called for input on developing an βAI Action Plan.β Our response outlines the steps the U.S. should take to:
1οΈβ£Secure and advance its AI leadership
2οΈβ£Navigate competition with China
3οΈβ£Realize AIβs benefits and avoid its risks
Why does #AI red-teaming suck? How can we make it suck less?
All this and more at the next CSET webinar: cset.georgetown.edu/event/whats-...
Join me, the Director of Microsoft's AI Red Team, the MITRE ATLAS Lead, and the Director of Apollo Research. This will be an awesome conversation.
What: CSET Webinar πΊ
When: Tuesday, 3/25 at 12PM ET π
Whatβs next for AI red-teaming? And how do we make it more useful?
Join Tori Westerhoff, Christina Liaghati, Marius Hobbhahn, and CSET's @dr-bly.bsky.social * @jessicaji.bsky.social for a great discussion: cset.georgetown.edu/event/whats-...
I gave my takes from the Paris AI Action Summit to @thecipherbrief.bsky.social last week. I heard three Nobel Laureates warn about the risks of AI, while VP Vance was "not here ... to talk about AI safety" and focused on opportunity. I think we can innovate on AI without building unsafe products.
18.02.2025 19:18 β π 1 π 0 π¬ 0 π 0@miahoffmann.bsky.social , @ojdaniels.bsky.social, and I wrote a piece on key AI governance areas to watch in 2025 with the upcoming AI Action Summit in mind. Check it out here! thebulletin.org/2025/02/will...
07.02.2025 03:00 β π 5 π 3 π¬ 0 π 0We're hiring π’
CSET is looking for a Research Fellow to analyze topics related to the development, deployment, and operations of AI & ML tools in the national security space.
Interested or know someone who would be? Learn more and apply π cset.georgetown.edu/job/research...
CSET is hiring π’
Weβre looking for our next Director of Analysis to lead our research agenda & manage CSET's talented team of researchers.
Interested or know someone who would be? Learn more and apply π cset.georgetown.edu/job/director...
I'm packing my bags for an exciting week in Paris! I'll be contributing to the Paris Peace Forum's AI-Cyber Nexus discussions, and I'll be attending @iaseai.bsky.social '25 and the Paris AI Security Forum. I'm registered for a few other events, as well (if I can find the time and energy to attend).
03.02.2025 20:25 β π 1 π 0 π¬ 0 π 0The fact sheet [1] on Pres. Trump's AI Executive Order calls out the administration's previous work on regulating AI with Kratsios's Op-Ed on "AI that Reflects American Values". I think there's an RLHF-to-CoT narrative that highlights the value of safety in AI.
1: www.whitehouse.gov/fact-sheets/...
Just as "we don't need to choose between freedom and technology", we don't need to choose between safety and innovation in AI. The development of voluntary standards, best practices, and safeguards for AI makes AI better and easier to adopt.
29.01.2025 18:33 β π 2 π 0 π¬ 1 π 0If you're interested in AI + Policy you need to follow my colleagues at @csetgeorgetown.bsky.social
We have a starter pack: bsky.app/starter-pack...
β¨ποΈNew interactive toolποΈβ¨
Use the latest @emergingtechobs.bsky.social resource to track AI governance from D.C. to Beijing.
ETO AGORA is a living collection of AI-relevant laws, regulations, standards, and other governance documents from the U.S. and around the world π
Workshop Announcement π―
On Dec 4 (1 day after AISIC Plenary) @csetgeorgetown.bsky.social is hosting a workshop on the future of AI testing.
Workshop will run from 9am-1pm on Georgetown's Capitol Campus.
If you have experience testing AI systems please reach out to me for more details!