Phillip Misner's Avatar

Phillip Misner

@phillipmisner.bsky.social

Head of AI Incident Detection & Response @ MSFT, ecosystem & customer advocate, incident responder, PSIRT enthusiast, safety & security.

62 Followers  |  56 Following  |  4 Posts  |  Joined: 20.11.2024  |  1.26

Latest posts by phillipmisner.bsky.social on Bluesky


Preview
Safeguarding AI against โ€˜jailbreaksโ€™ and other prompt attacks How Microsoft is helping developers mitigate the risk of prompt attacks on generative AI applications.

AI jailbreaks are a common concern where attackers can influence the outcome of generative AI models. This week we released more guidance on how developers can protect against these threats: news.microsoft.com/source/featu...

05.12.2024 23:50 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

more information at aka.ms/zerodayquest

05.12.2024 23:48 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

At Microsoft Ignite in late November, Satya announced the Zero Day Quest. Based on the bounty programs, this new 2-stage event focuses on cloud & AI research. Targets are scoped to the bounty program & AI safety research is out-of-bounds, but this is an important step in the maturity of the tech.

05.12.2024 23:48 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

hello world!

20.11.2024 23:29 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@phillipmisner is following 20 prominent accounts