's Avatar

@pamelafox.bsky.social

1,482 Followers  |  143 Following  |  482 Posts  |  Joined: 29.10.2024  |  2.1028

Latest posts by pamelafox.bsky.social on Bluesky

Comparison of outputs from two models

Comparison of outputs from two models

I am testing out the GPT-5 models for our RAG apps, and I'm impressed so far: gpt-5-mini avoided a hallucination that every other model generated. Much better for a RAG app to say it doesn't know than to hallucinate!

07.08.2025 18:18 โ€” ๐Ÿ‘ 12    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

Update: I was using gpt-5-chat, which is NOT a reasoning model. Trying out the other ones now.

07.08.2025 17:48 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I'm trying out gpt-5-chat now, and am confused as to whether it's a reasoning model.
It does *not* take in a reasoning effort parameter, and I haven't seen reasoning tokens in any of the outputs so far. But I thought they said it sometimes reasons?

07.08.2025 17:39 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

GPT-5 is rolling out to VS Code today!

Starting today, GPT-5 is rolling out to all paid GitHub Copilot plans. GPT-5 is OpenAI's most capable model yet, bringing new advances in reasoning, coding, and chat.

Learn more about the GPT-5 model availability: github.blog/changelog/20...

07.08.2025 17:06 โ€” ๐Ÿ‘ 16    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Introducing GPT-5
YouTube video by OpenAI Introducing GPT-5

GPT-5 announcement happening now:

07.08.2025 17:06 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

how slow did it run? on my M1 with 16GB RAM, it took ~10s per token. another person with M2 and 16GB saw ~4s per token. my colleague with M4 and 64GB RAM got all the way to 40 token/s.
(i got so bored i turned it off after 6 tokens but clearly I have patience problems)

05.08.2025 22:38 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Post image

Macbook Air M2 (16GB RAM). No GPU. About 4 seconds per token.

05.08.2025 20:53 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

sounds like M2 is 2x speed of my M1. I cut it off after 6 tokens as I couldnt deal with it. Also btop looks even cooler than asitop! thanks for sharing.

05.08.2025 21:26 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Screenshot of ollama running while asitop displayed in other tab

Screenshot of ollama running while asitop displayed in other tab

I tried out the new OpenAI gpt-oss:20b using Ollama on my Mac M1 (16GB RAM).
It was 16 GB download, and required ~10 seconds per token, once it finally started thinking.
Not practical for use on my machine. Anyone else tried it?

05.08.2025 17:54 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Thank you so much for replying! They board at the El Cerrito Plaza, I've seen them twice now at that station. I'll text police if I see them again.

05.08.2025 17:16 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I haven't talked to my older brother since learning who he voted for, and his reasons for his vote. I feel slightly bad since he's since lost his job and is harassing our mum for money, but I just can't motivate myself to put effort into helping him.

05.08.2025 17:15 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Have you ever pushed a hard-coded key to a repo?
YouTube video by Microsoft Developer Have you ever pushed a hard-coded key to a repo?

"Have you ever pushed a hard-coded key to a repo?"

www.youtube.com/shorts/WffwL...

05.08.2025 00:52 โ€” ๐Ÿ‘ 4    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

sent to my brother in mission, but agreed on nextdoor posting. good luck, hope she's found soon.

05.08.2025 00:08 โ€” ๐Ÿ‘ 27    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Prepping a talk demo based on my struggles to find a good garden hose.
SERIOUSLY THEY ALL BREAK. MY HEART.
Does anyone have a good hose rec? I will pay money if it actually wont break.

05.08.2025 00:03 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

so cute! and dang thats cool that your library has a laser cutter. i miss my days as a techshop member.

04.08.2025 22:33 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

yeah luckily we have both a fall blockparty on our street and a halloween "blockparty" (i.e. street closure so folks can walk it)
I feel a lil guilty closing down the street for halloween since not everyone participates in it, but I do email our neighbors list about it, so hopefully okay..

04.08.2025 20:34 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

There's someone in our town who rides the BART train with a sheathed sword - is that allowed in California?
I was trying to be okay with it, but then they un-sheathed the sword (to check it was still there??) while the train was moving and they were standing up. That seems pretty dangerous, no?

04.08.2025 20:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

AUSTIN, Texas (AP) โ€” Texas House falls short of quorum after Democrats leave state to prevent Trump-backed redraw of US House maps.

04.08.2025 20:07 โ€” ๐Ÿ‘ 1482    ๐Ÿ” 267    ๐Ÿ’ฌ 32    ๐Ÿ“Œ 41

love it! I wish every night was National Night Out, block parties are my fav.

04.08.2025 20:10 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Purchased the markers! (I'm curious if Oakland teachers also use DonorsChoose, as thats how I typically support local teachers, but I know Amazon wishlists are popular for hyper-local supplies-fund-raising)

04.08.2025 20:09 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Oakland high schools need supplies, either donated physically at City Hall or via an Amazon wishlist (www.amazon.com/hz/wishlist/...)

04.08.2025 18:14 โ€” ๐Ÿ‘ 13    ๐Ÿ” 9    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

wait what about the fantasy where I retire and setup an entire model railroad in the garage, complete with exploding volcanoes?

04.08.2025 19:05 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Table of attack success rate, 16 percent for hermes

Table of attack success rate, 16 percent for hermes

I wrote up the results of red-teaming a RAG app, comparing gpt-4o-mini vs llama3.1:8b vs hermes3:3b and sharing examples of the successful attacks.
I def recommend red-teaming any LLM-powered apps you're putting into production - malicious users can get LLMs to say some pretty horrid things!

04.08.2025 17:10 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

New horror movie about to drop:
"I KNOW WHAT YOU VIBE CODED LAST SUMMER"

04.08.2025 06:05 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
My fish amongst other fish

My fish amongst other fish

"Draw a fish" is my fav website today.

https://news.ycombinator.com/item?id=44719222

01.08.2025 23:49 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

is that pink oyster? i just did a pink oyster kit from Far West Fungi, pretty fun! mushrooms are cool

01.08.2025 20:20 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Persona Vectors: Monitoring and Controlling Character Traits in Language Models Large language models interact with users through a simulated 'Assistant' persona. While the Assistant is typically trained to be helpful, harmless, and honest, it sometimes deviates from these ideals...

Persona Vectors

brb ๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€

Anthropic just dropped this paper. They can steer models quite effectively, and even detect training data that elicits a certain (e.g. evil) persona

arxiv.org/abs/2507.21509

01.08.2025 17:30 โ€” ๐Ÿ‘ 114    ๐Ÿ” 20    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 8

Running Python + AI Office Hours right now in the Azure AI Foundry Discord. Come on down!
http://aka.ms/aipython/oh

31.07.2025 20:04 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Residents concerned about trash at Shoppingtown mall Residents concerned about trash at Shoppingtown mall

i just found out that my old fav mall from childhood has been abandoned and is accumulating trash. oh how the mighty have fallen.
www.yahoo.com/news/residen...

31.07.2025 19:13 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

am only at 65%, time to refactor a codebase

31.07.2025 19:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@pamelafox is following 20 prominent accounts