Adrian Thinnyun's Avatar

Adrian Thinnyun

@adrianthinnyun.com.bsky.social

Horizon Junior Fellow, Center for Security and Emerging Technology (CSET) https://cset.georgetown.edu/staff/adrian-thinnyun/

78 Followers  |  61 Following  |  12 Posts  |  Joined: 26.11.2024  |  1.5684

Latest posts by adrianthinnyun.com on Bluesky

My last recommendation is to support the development of evaluations for AI capabilities and risks. The AI Action Plan already includes this, but it should go one step further and consider restricting models that fail to meet industry-standard performance levels of safety. (7/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

My second recommendation is to push AI companies to share safety-relevant knowledge with each other and other relevant stakeholders. This would involve mandating reporting requirements, disclosure of unexpected capabilities in new models, and sharing threat intelligence. (6/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My first recommendation is to require AI companies to adhere to their own risk management plans. Companies like OpenAI and Anthropic have already published frameworks describing their planned risk mitigations, but these need to be made legally binding to have any effect. (5/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

At the same time, AI is advancing too rapidly for government to keep up with traditional regulation. The solution is to promote industry self-regulation – make AI companies figure out the best way to keep their products safe and then make sure they actually follow through. (4/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

It's true that we need to increase AI adoption, but quite simply – people don't want to use things that aren't guaranteed to work! There's still too many hallucinations, security concerns, and other liabilities for companies to feel confident relying on AI for important tasks. (3/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Why Donald Trump’s AI Strategy Needs More Safeguards Like nuclear energy, AI is a transformative technology that could face a severe backlash if the right precautions are not taken.

Check it out here: (2/7)

28.07.2025 15:13 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Trump's AI Action Plan promises to put AI innovation first and safety second, but this is a false dichotomy. In my new piece for The National Interest, I explain why innovation can't happen without safety, and how government can help industry regulate itself. 🧡 (1/7)

28.07.2025 15:13 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
CSET's Recommendations for an AI Action Plan | Center for Security and Emerging Technology In response to the Office of Science and Technology Policy's request for input on an AI Action Plan, CSET provides key recommendations for advancing AI research, ensuring U.S. competitiveness, and max...

Check out the full response to the RFI here: [4/4]
cset.georgetown.edu/publication/...

17.03.2025 20:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

These standards, if implemented, would go a long way towards mitigating the potential risks of AI and increasing public trust and confidence in using it, allowing us to realize its benefits sooner than we could otherwise. [3/4]

17.03.2025 20:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Specifically, AISI should develop standards on topics such as model training, pre-release internal & external security testing, cybersecurity practices, if-then commitments, AI risk assessments, and processes for testing and re-testing systems as they change over time. [2/4]

17.03.2025 20:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Earlier today, @csetgeorgetown.bsky.social published our recommendations for the U.S. AI Action Plan. One recommendation that I personally contributed was that the U.S. government should develop and adopt standards to mitigate risks from AI. What kind of standards? Read below: 🧡 [1/4]

17.03.2025 20:34 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

AI & emerging tech policy is going to experience rapid development over the next four years. The best way you can keep up-to-date on the latest changes is by following my colleagues at @csetgeorgetown.bsky.social and checking out the CSET Starter Pack: bsky.app/starter-pack...

22.01.2025 23:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@adrianthinnyun.com is following 20 prominent accounts