It seems like berries really are the LLM model’s weakness.
08.08.2025 00:29 — 👍 1 🔁 1 💬 1 📌 0@morecoffeeplz.bsky.social
AI Research scientist. Former OpenAI, Apple infosec. “Professor” at John’s Hopkins SAIS Alperovitch Institute. Great deceiver of hike length and difficulty.
It seems like berries really are the LLM model’s weakness.
08.08.2025 00:29 — 👍 1 🔁 1 💬 1 📌 0Human just outside the loop.
30.07.2025 19:57 — 👍 0 🔁 0 💬 0 📌 0i unironically think that the richard scarry pig who is a butcher who is also selling bacon and other pork products is an excellent metaphor for a lot of what is going on right now tbh
22.07.2025 15:42 — 👍 698 🔁 182 💬 1 📌 0A challenge with AI adoption is that organizations are not built to a Grand Plan where AI can just be slotted in by leaders, but rather socially constructed, semi-random & in flux
Here's an anecdote from a paper on how a CEO realized he didn't understand how things really worked (& that nobody did)
The thing that stands out to me about the latest code IDEs (windsurf, cursor, etc) isn’t that they get themselves into trouble - it’s that they can (and often do) get themselves OUT of trouble.
30.06.2025 21:42 — 👍 0 🔁 0 💬 0 📌 0I have literally never seen a graph like this once in all my years of being an election sicko
18-24 always lags, always, it’s just a natural law of politics
and this is the guy they want to throw out of the party?!
🤣🤣🤣
30.06.2025 04:17 — 👍 1 🔁 0 💬 0 📌 0🧵 THREAD: Why the Cato Institute’s new paper on “misinformation panic” is dangerously wrong, and why it completely misunderstands the real crisis we face.
www.cato.org/policy-analy...
I’m logging on from vacation in Tokyo to share this banger of a report comparing the cyber offensive acquisition strategies of China and the US.
www.atlanticcouncil.org/in-depth-res...
Introducing AIRTBench, an AI red teaming benchmark for evaluating language models’ ability to autonomously discover and exploit AI/ML security vulnerabilities.
Read the paper on arXiv: arxiv.org/abs/2506.14682
Open-source dataset and benchmark eval code repo: github.com/dreadnode/AI...
Infosec - but especially threat intel is a team sport
14.06.2025 17:13 — 👍 5 🔁 1 💬 0 📌 0A singular act of defiance that will never go out of style.
11.06.2025 00:22 — 👍 84 🔁 9 💬 2 📌 0This week's show is live on all platforms (Mikko Hypponen fills in for Juanito) @craiu.bsky.social @jags.bsky.social @mikko.bsky.social
securityconversations.com/episode/mikk...
Bellingcat has followed up its 2023 article on testing LLMs on geolocation with a new review of LLM's geolocation skills, with dramatically improved results www.bellingcat.com/resources/ho...
06.06.2025 07:51 — 👍 203 🔁 57 💬 5 📌 4If it cures the anxiety… 🤔
03.06.2025 16:55 — 👍 1 🔁 0 💬 0 📌 0One positive outcome of "vibe coding" becoming widespread across devs is that prototyping becomes widespread.
Until now, it was accepted that for most engineers, prototyping is kind of hard? So I saw relatively few do it.
Now: SO easy with these AI tools! Fewer excuses left?
Incredible tool to collect data for a finetune.
02.06.2025 21:12 — 👍 2 🔁 0 💬 0 📌 0👀…
02.06.2025 16:20 — 👍 0 🔁 0 💬 1 📌 0Whoever you are.
I see you LYING in your metadata and messing up my data.
I'm not mad, but I want you to know that I am writing a parser just. for. YOU.
How can we make informed choices based on performance AND energy when using AI in real-life tasks like question answering? By evaluating them and picking the models that optimize both factors!
Check out my new blog post on the subject:
huggingface.co/blog/sasha/e...
God cursed me with a lot of stuff but he did bless me with thinking oatmeal tastes absolutely decadent
21.05.2025 04:47 — 👍 78 🔁 5 💬 1 📌 5Tweet by Sam Bowman @sleepinyourhat If it thinks you're doing something egregiously immoral, for example, like faking data in a pharmaceutical trial, it will use command-line tools to contact the press, contact regulators, try to lock you out of the relevant systems, or all of the above.
welcome to the future, now your error-prone software can call the cops
(this is an Anthropic employee talking about Claude Opus 4)
"You read books, eh?" A 1949 red scare Herblock cartoon about pressures on teachers that hits hard once again.
21.05.2025 12:54 — 👍 17167 🔁 4984 💬 218 📌 182wife: how was guarding the two paths today, honey?
guard: [looking away] fine
wife: did something happen?
guard: [tearing up] no
wife: would the other guard tell me something happened?
Imagine starting a car that hadn't run in 21 years, that's 15 billion miles away in interstellar space. That's what the NASA team just did with Voyager's thrusters. People are amazing. jpl.nasa.gov/news/nasas-v...
17.05.2025 12:59 — 👍 9453 🔁 1748 💬 327 📌 165Will my skills pay the bills?!?
And by bills, I mean mortgage.
And by skills, I mean vibe coding.
A key element of the administration’s repurposing of wartime and counterterrorism framings to other ends is aesthetic.
Here ICE agents are kitted out like wannabe GWOT operators while conducting a raid in the badlands of the Berkshires.
www.berkshireeagle.com/breaking/ice...
There's a 2021 stealer log for a telemessage employee's hootsuite login. Other websites in the same stealer log (looks like websites the same machine visited) include readcomiconline[.]to and kissanime[.]ru. For national security reasons, Anime was a mistake
06.05.2025 01:40 — 👍 46 🔁 5 💬 5 📌 0This is incredible news! Every API should offer this type of endpoint. If someone finds a leaked API key, let them report it back through the API safely.
"Credential revocation API to revoke exposed PATs is now generally available"
github.blog/changelog/20...