Feel free to reach out if you wanna chat AI/ML security. Contact info on my website: sshussain.me
12.09.2025 00:10 β π 0 π 0 π¬ 0 π 0
@suhacker.bsky.social
AI/ML security https://suhacker.ai
Feel free to reach out if you wanna chat AI/ML security. Contact info on my website: sshussain.me
12.09.2025 00:10 β π 0 π 0 π¬ 0 π 0It was wonderful to help AI/ML security at the company evolve from a summer internship project to an established practice. Not only did I get to work on impactful and interesting audits, research, and engineering projects, but I also got to learn from some truly brilliant people.
12.09.2025 00:10 β π 1 π 0 π¬ 1 π 0After a little over 5 years at Trail of Bits, I have decided to move on. Iβm exceptionally excited about this new chapter. Thereβs so much more work to be done in securing AI/ML systems and Iβm looking forward to what's ahead.
12.09.2025 00:10 β π 0 π 0 π¬ 1 π 0What if you sent a seemingly harmless image to an LLM and it suddenly exfiltrated your data? Check out our new blog post where we break AI systems by crafting images that reveal prompt injections when downscaled. Weβre also releasing a tool to try this attack. blog.trailofbits.com/2025/08/21/w...
21.08.2025 17:36 β π 0 π 0 π¬ 0 π 0it delegates to the code execution agent via the orchestrator! delegation is done by the web surfing agent to the orchestrator then to the code execution agent. we should make that sentence less confusing! earlier in the post, mas hijacking is defined as prompt injection targeting MAS control flow.
18.08.2025 04:34 β π 0 π 0 π¬ 0 π 0So, we wrote a neural net library entirely in LaTeX...
01.04.2025 12:29 β π 81 π 15 π¬ 3 π 4KNN + topic detection getting a big glow-up www.anthropic.com/research/clio
13.12.2024 12:06 β π 51 π 9 π¬ 3 π 1Rather than trying to do advent of code, I'm doing advent of papers!
jimmyhmiller.github.io/advent-of-pa...
Hopefully I can read and share some of weirder computer related papers.
First paper is Elephant 2000 by John McCarthy. Did you know he didn't just make lisp? Wonderful paper, worth a read.
trying to explain the OSI model to an american: imagine if a burger had 7 patties
08.12.2024 13:35 β π 3143 π 232 π¬ 180 π 30(someone used a carefully crafted branch name to inject a crypto miner into a popular Python package: github.com/ultralytics/...)
06.12.2024 03:28 β π 247 π 55 π¬ 5 π 8Someone tried to reply to my blog post about avoiding PGP with anti-furry hate, so now I have to edit it to include more furry stickers.
soatok.blog/2024/11/15/w...
Women in AI: Heidy Khlaaf, safety engineering director at Trail of Bits
10.03.2024 12:34 β π 13 π 5 π¬ 0 π 0My team at Trail of Bits added modules for modular analysis, polyglots, and PyTorch to Fickling, a pickle security tool tailored for ML use cases.
Fun Fact: Fickling can now differentiate and identify the various PyTorch file formats out there.
blog.trailofbits.com/2024/03/04/r...
Thinking about Dan Kaminsky's quote this morning about the necessary lies we tell ourselves about computers. Specifically, the myth of boundaries between users. Great write-up by @lhn.bsky.social on the "LeftoverLocals" GPU vuln. Nice work by the Trail of Bits team.
16.01.2024 17:40 β π 27 π 9 π¬ 0 π 0Specifically, int.to_bytes and int.from_bytes default to big-endian, since py3.11. Previously, you had to explicitly specify which you wanted.
I wanted LE but forgot to specify, and my code failed in really non-obvious ways...
I got to work on a security review of the YOLOv7 vision model. The blog post and report are out now!
Fun fact: There are TorchScript model differentials!
blog.trailofbits.com/2023/11/15/a...
I presented at HackLu about oddities of existing file formats and lessons learned along the way.
Consider it a teaser, as I presented 1/3 of the slide deck (to be released soon).
www.youtube.com/watch?v=6OJ9...
Neopets taught so many kids how to code, but it taught me how to hack the system by creating multiple accounts and transferring items just up to the limit where you wouldnβt get caught. And anyway, today Iβm a cyber lawyer.
17.10.2023 01:47 β π 20 π 4 π¬ 2 π 0Hi, Iβd like to return these turtles. They donβt do karate
13.10.2023 00:49 β π 283 π 78 π¬ 9 π 1Tweet from Mike Conover with a slide listing Top information sources for Al Engineers, courtesy of @barrmanas & @AmplifyPartners. | NEWSLETTERS 1. Import AI 2. arXiv roundup 3. The Batch PODCASTS 1. Latent Space 2. Gradient Descent 3. The Cognitive Revolution 4. The Gradient COMMUNITIES 1. Hacker News 2. OpenAI Discord 3. LangChain Discord 4. HuggingFace discussions
These lists may be useful for those of us trying to develop an alternative to ML Twitter, now that it's 40% influencer spam and 20% a war between sci-fi subcultures. I'm on some of these discords and reading some of these newsletters, but I think I'll add 2 or 3 more. #MLsky #cssky
10.10.2023 20:55 β π 14 π 2 π¬ 2 π 0Enormous thank you to PyData Amsterdam for inviting me to keynote at a beautiful venue! Slides and notes from my talk, "Build and keep your context window" are all here: vickiboykis.com/2023/09/13/b...
14.09.2023 12:10 β π 35 π 7 π¬ 0 π 0See https://www.explainxkcd.com/wiki/index.php/2044:_Sandboxing_Cycle#Transcript
I think about this a lot xkcd.com/2044/
13.09.2023 21:51 β π 139 π 22 π¬ 5 π 0ICYMI: This is **critical** work for AI ethics / safety / security / regulation right now: Verifying that a model is fitted on a given dataset.
https://arxiv.org/abs/2307.00682
Iβve conjectured this for years, but seeing Papernot and Shumailov on the paper makes me feel really confident in the findings: https://arxiv.org/abs/2305.17493
Existential risk ππππ
Screenshot of a tweet from @ huggingface on twitter; text reads: "We are looking into an incident where a malicious user took control over the Hub organizations of Meta/Facebook & Intel via reused employee passwords that were compromised in a data breach on another site. We will keep you updated π€"
So remember the "mango pudding" LLM backdooring attack? How safe do you feel using these models now?
03.07.2023 13:40 β π 2 π 1 π¬ 1 π 0