broke my toe this week but we still shipped ep 2 ๐
DGX Spark: โinference boxโ vs dev rig
โถ๏ธ youtu.be/0CI19dXmOws
reply w/ your stack: GPU/CPU + RAM + runner + model/quant
broke my toe this week but we still shipped ep 2 ๐
DGX Spark: โinference boxโ vs dev rig
โถ๏ธ youtu.be/0CI19dXmOws
reply w/ your stack: GPU/CPU + RAM + runner + model/quant
Last week this post was for paid subscribers. Today itโs live for free subscribers.
Have You Heard of Logits? Tool calling vs grammars, and the โone character offโ failure mode that turns into agent loops.
open.substack.com/pub/soypetet...
I keep hearing about tool calling. I rarely hear about logits.
If youโre building agents (not chatbots), โalmost validโ outputs donโt fail gracefully. They turn into retries, loops, and wasted cycles.
open.substack.com/pub/soypetet...
I wrote up the workflow I actually use with AI as a software engineer: scope by ticket, keep debugging in the same session, use tests as forcing functions, and keep guardrails in Claude.md.
open.substack.com/pub/soypetet...
2025 was a wild rideโAI deep-dives, honest data modeling talks, and more community time than ever. My take: AI doesnโt replace engineering, it enables it. Check out my year-in-review thoughts and whatโs coming in 2026. open.substack.com/pub/soypetet...
31.12.2025 22:40 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
#GoWestConf
Sharing a fantastic blog by @soypetetech.bsky.social!! ๐
Thank you so much for mentioning my session!
Even as a first-timer, Iโm really happy and proud of being part of such an amazing moment with everyone. ๐ฅฐ
Five Years of Go West
soypetetech.substack.com/p/five-years...
Go West Conf 2025 is LIVE ๐
Sixth year since 2020 โ 85% Utah-based, 100% community-powered.
Couldnโt do it without Derrick, Annalisa, and Boston โค๏ธ
๐ฅ Watch live: twitch.tv/soypetetech
๐ And join us next year for the best Go vibes around.
#GoWest #Golang #UtahTech
Been thinking about how most GenAI tools feel designed from the front-end in. Langfuse, LangChain, even MCP all assume the client leads.
What happens when we shift toward server-side agents?
I think Go has a role to play.
open.substack.com/pub/soypetet...
#genai #golang #aiinfra #llmops
Just published a new post
Go was built for concurrencyโbut should you be using it? Most modern Go services run better without it.
Hereโs when to lean in, and when to step back.
๐ open.substack.com/pub/soypetet...
#golang #microservices #cloudnative #devtools
AI just leveled up for devs.
I turned a 2-day Go job into a 4-hour sprint using Claude Code.
Prompt-driven AI > autocomplete.
Youโre still the engineerโAI just ships faster.
๐ Read:
open.substack.com/pub/soypetet...
#ClaudeCode #Golang #AIforDevs #LLMOps #DevTools
I gave a talk recently about self-hosting AI models, and Iโve turned that into a new post.
The post covers how I started with local-first tools like Ollama and Llama.cpp, and why I still run them
Check it out here:
open.substack.com/pub/soypetet...
#LLMDev #SelfHost #LocalAI #Ollama #LlamaCpp
Finally setting up my Mac Studio on stream today. I've got a script that boots up my whole dev environment in one shotโCLI tools, AI stuff, everything. Come hang if you like clean setups, terminal life, or software tools that just work.
๐ด [twitch.tv/soypetech]
#MacStudio #DevSetup #HomeLab
Small data team? Big goals? I wrote about how Iโve built practical, high-impact data platforms as a team of oneโbalancing AI dreams, BI demands, and real-world constraints.
Read & comment: substack.com/@soypetetech...
#DataEngineering #Startups #ModernDataStack #AI #BI
Everyoneโs building agents. But the truth? Theyโre just software wrapped around an LLM.
If we want reliability, scale, and ROI, we need fewer frameworks and more engineeringโand we canโt ignore NLP.
open.substack.com/pub/soypetet...
#LLM #AI #NLP #DataEngineering #Agents #LangChain #Substack
Can an M3 Ultra Mac really outperform an RTX 5090 PC in LLM benchmarks?
This Memorial Day, Iโm running a full LLaMA.cpp showdown to test it live.
Will be Live on Twitch & YouTube.
#LLAMAcpp #RTX5090 #M3Ultra #MacStudio #AIInfrastructure #Benchmarking #LocalLLM #OpenSourceAI
New post: Prompt Engineering Without the Bloat
What I learned building and why most AI features come down to 2 questions:
How do you talk to the model?
How do you talk to the user?
No overengineering. Just clean design.
open.substack.com/pub/soypetet...
#GenAI #PromptEngineering #SoftwareDesign
gosh I love me some IAC. I dont know why, but it just is sooo cool to run
terraform apply
and watch a world get built
Just streamed compiling llama.cpp for GPU on my RTX 1590.
Ran into:
CUDA arch 12.0 not supported yet
Missing curl + SSL dev libs
WSL docs buried deep
With Twitch chatโs help, we got it running!
๐ open.substack.com/pub/soypetet...
#llamacpp #AIinfra #opensource #gpu #WSL2
I just posted a new video breaking down Pedroโs full 5090 upgrade:
Setup, WSL, LLaMA.cpp install, and a 9x performance boost.
Iโm trying to hit 500 subscribers to unlock YouTube monetizationโthis is now my full-time gig.
Drop a comment if you like the video!
youtu.be/hm4_VJP4GnE
Wild analytics anomaly this week on Twitch:
Jumped from 9 to 835 viewers.
No raid. No follow bump. Justโฆ poof. 800 viewers?
But only 57 unique viewers.
68 live views.
No other signals.
Gut check matters.
Donโt trust dashboards at first glance.
Sanity-check your dataโalways.
#dataengineering
great! just make sure you have a few rounds of human revision before production ๐
30.04.2025 22:28 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Got a technical deep dive on Go?
GoWest Conf 2025 wants:
โข Production stories
โข Tools & internals
โข Compiler tricks
โข Infra, performance, scale
๐ค Submit your in-person talk:
๐ sessionize.com/gowest-conf-...
๐ Lehi, UT โ Oct 24
๐ง Live now! Thought I could just download a binaryโฆ turns out I need to recompile llama.cpp for GPU ๐
Todayโs stream includes:
- CMake & CUDA debugging
- Compiling from source
- Running a bigger model from Hugging Face
๐ twitch.tv/soypetetech
#llamacpp #AI #CMake #CUDA
I made a supercut of my RTX 5090 unboxing streamโ
featuring the MSI Infinite RS Tower w/ the Ultra 9 285K and all the fixings.
If you were there live, drop a like!
If not, check it out and see whatโs powering Pedro now:
youtu.be/ixtMcmEZtGo
๐ฅ + โ๏ธ Live now!
Todayโs stream is half content creation, half DevOps adventure:
Recording the talking head segment for the PedroGPT unboxing vid
Deploying the Discord bot to Kubernetes
Come for the AI bot, stay for the cluster chaos.
www.twitch.tv/soypetetech
#Kubernetes #DevOps #LiveCoding
Hey everyone!
Todayโs stream is all about refactoring Pedroโs Connector APIโwe're improving how the LLM connects to Twitch + Discord and pulling out the database logic to make the AI layer clean, reusable, and self-contained.
๐ twitch.tv/soypetetech
I left my last job as part of the lay-offs.
While I job hunt, Iโm going all-in on building content, learning new tech, and investing in myself while I look for the next thing.
Leads welcome!
All links: linktr.ee/soypete_tech
๐ง Live now! Weโre refactoring the Pedro Connector API to better integrate our LLM with Twitch + Discordโand removing database logic from the AI layer to make it self-contained.
Cleaner code, smarter bots.
๐ twitch.tv/soypetetech
#AI #LLM #Golang #LiveCoding #TwitchDev
๐จ Todayโs Stream: PedroGPT Goes K3s! ๐จ
๐ twitch.tv/soypetetech
๐ youtube.com/c/miriahpete...
Join me and let's make Pedro cloud-native!
#K3s #Kubernetes #PedroGPT #LLMs #DevOps #HomeLab #LiveCoding #SelfHosted #Tailscale #Prometheus #InfraAsCode
Going live with Pedroโs Upgrade Party ๐
๐ RTX 5090 unboxing
๐ค Migrating my LLM bot Pedro
๐ LLaMA.cpp vs Ollama showdown
๐ณ๏ธ Community poll results live on stream!
๐บ Watch on Twitch + YouTube
twitch.tv/soypetetech