Karl Weinmeister's Avatar

Karl Weinmeister

@kweinmeister.bsky.social

Cloud Developer Advocacy @ Google. AI/ML/Data, Blue Devil & Longhorn, wanna-be at home improvement. Opinions are my own.

512 Followers  |  1,936 Following  |  118 Posts  |  Joined: 24.10.2024  |  1.7711

Latest posts by kweinmeister.bsky.social on Bluesky

Preview
Python and Rust interoperability: A walkthrough for building a high performance MCP server You’ll learn step-by-step instructions for including Rust code with your Python code. We’ll build a tool for AI agents compliant with MCP.

Have you seen Python libraries “powered by Rust” and wondered how you could do it, too?

My new article walks you through every step of the way. It shows you how you can build a Rust-based MCP tool served by a Python FastMCP server.

medium.com/google-cloud...

@thisweekinrust.bsky.social

20.10.2025 13:55 — 👍 5    🔁 0    💬 0    📌 0
Post image

n8n AI automation docs for Cloud Run are now here!

docs.n8n.io/hosting/inst...

17.10.2025 21:33 — 👍 3    🔁 0    💬 0    📌 0

You can also use Docker!

16.10.2025 22:03 — 👍 1    🔁 0    💬 1    📌 0
Post image

vLLM TPU is now powered by tpu-inference!
* Broader model coverage and feature support
* 5x faster performance than Feb 2025 version

Learn more: blog.vllm.ai/2025/10/16/v...

16.10.2025 20:41 — 👍 3    🔁 0    💬 1    📌 0
How will AI influence language ecosystems?
Rust ⬆️ Python ⬇️ That's my prediction 2 years out. Why? 1. Library support is no longer a moat. Existing libraries will proliferate in more languages or be bypassed with vibe-coding. 2. A… How will AI influence language ecosystems?

Watch for more: www.youtube.com/shorts/M215l...

14.10.2025 13:02 — 👍 0    🔁 0    💬 0    📌 0

Rust ⬆️
Python ⬇️
That's my prediction 2 years out. Why?

1. Library support is no longer a moat. Existing libraries will proliferate in more languages or be bypassed with vibe-coding.
2. A language's perceived difficulty will not be the barrier to adoption it once was, if it offers unique value.

14.10.2025 13:02 — 👍 3    🔁 0    💬 1    📌 0
Preview
Deploy Faster with Terraform: Your Guide to vLLM on GKE with Infrastructure-as-Code This article is a practical guide on how to use Terraform for agile ML engineering with IaC.

💥 Stop manually spinning up GPU clusters. Deploy vLLM on GKE more reliably with Terraform.

Learn:
✅ AI inference with IaC
✅ Spot + on-demand GPU node pools
✅ Persistent model caching with PVCs
✅ GitOps CI/CD pipelines via GitHub Actions

Full working code + architecture:
medium.com/google-cloud...

13.10.2025 14:02 — 👍 1    🔁 1    💬 0    📌 0

Yes! Especially if you work with Cloud tech, Terraform is a great skillset to learn.

10.10.2025 20:37 — 👍 1    🔁 0    💬 0    📌 0
Should you learn Terraform?
Is Terraform worth learning? Is it useful even for small projects? Here’s my take. #DevOps #MLOps #AI Should you learn Terraform?

Is Terraform worth learning? Is it useful even for small projects? Here’s my take.

youtube.com/shorts/qXsAJ...

#DevOps #MLOps #AI

10.10.2025 19:46 — 👍 0    🔁 0    💬 1    📌 0
The Agent Factory - Episode 10: Agent Security
YouTube video by Google Cloud Tech The Agent Factory - Episode 10: Agent Security

Are you up to speed yet on agent security? Catch up with Ayo Adedeji and Aron Eidelman on:
- Real-world attack vectors
- Practical implementations
- Multi-agent considerations

youtu.be/nxezufaezHw

09.10.2025 13:22 — 👍 0    🔁 0    💬 0    📌 0
Post image

What API call are you making 10x a day? 🤔 Turn it into a simple /command.

With Gemini CLI extensions, you can build your own shortcuts to speed up your work:
/fetch_jira ABC-123
/deploy_staging

Learn how easy it is to get started:
geminicli.com/docs/extensi...

08.10.2025 15:56 — 👍 3    🔁 0    💬 0    📌 0
Video thumbnail

Has Gemini ever felt like it's losing focus as your conversation goes on? Naturally, more context means more topics to cover.

Use the /compress command to keep the Gemini CLI on track. It prunes the history without a full reboot.

Get started today with the Gemini CLI: npx @google/gemini-cli

07.10.2025 17:03 — 👍 1    🔁 1    💬 0    📌 0
Preview
Gemini 2.5 Flash Image now ready for production with new aspect ratios- Google Developers Blog Our state-of-the-art image generation and editing model which has captured the imagination of the wo...

Not only is Nano Banana 🍌 production ready,
it now supports 10 aspect ratios!

Landscape: 21:9, 16:9, 4:3, 3:2
Square: 1:1
Portrait: 9:16, 3:4, 2:3
Flexible: 5:4, 4:5

03.10.2025 14:30 — 👍 1    🔁 0    💬 0    📌 0
Preview
AI-Generated “Workslop” Is Destroying Productivity Despite a surge in generative AI use across workplaces, most companies are seeing little measurable ROI. One possible reason is because AI tools are being used to produce “workslop”—content that appea...

Who’s your audience, and how are you offering value in your message?

AI tooling is immensely useful as a partner, but your engagement and insights remain essential.

hbr.org/2025/09/ai-g...

03.10.2025 11:55 — 👍 0    🔁 0    💬 0    📌 0
Video thumbnail

Dublin 🇮🇪 we're coming!
Learn to build AI agents and deploy your MCP servers to production scale.

Register: goo.gle/accelerate-ai-dublin
Seats are limited!

Shir Meir Lador @kweinmeister.bsky.social @caseywest.bsky.social
#AI #AIagents #MCPServers #CloudRun #Workshop @GoogleCloudTech #DublinEvents

30.09.2025 12:51 — 👍 1    🔁 2    💬 0    📌 0
DeepSeek Sparse Attention Explained
YouTube video by Cloud with Karl DeepSeek Sparse Attention Explained

Learn about Sparse Attention in DeepSeek-V3.2-Exp:
* O(L²) → O(L·k) with similar performance to V3.1 Terminus
* Lightning indexer scores previous tokens
* Top-k selector picks top 2k sparse tokens from 128k window

📄 Paper: github.com/deepseek-ai/...

🎬Video:
youtube.com/shorts/CLsju...

30.09.2025 01:23 — 👍 1    🔁 1    💬 0    📌 0
Preview
The Agency Spectrum: An AI Risk Management Framework We can now build autonomous systems that pursue meaningful, high-level goals. Yet, for every inspiring success story, there is a…

You have four autonomy dials you can tune in your agentic AI system. Are you using them?

medium.com/google-cloud...

26.09.2025 16:25 — 👍 0    🔁 1    💬 0    📌 1

All those pesky brackets making tokenization messy 😂

24.09.2025 23:05 — 👍 1    🔁 0    💬 0    📌 0

When I have a choice, I’ve been picking it over JSON. The readability and comments are nice!

24.09.2025 20:17 — 👍 0    🔁 0    💬 2    📌 0

Got it! Will pass on the feedback.

24.09.2025 18:08 — 👍 1    🔁 0    💬 1    📌 0
Preview
Regression due to #4739 Provide a better diff view on light-mode environments · Issue #5927 · google-gemini/gemini-cli What happened? #4739 was fixed with #4747 and released in July. On light-mode environments the theme used is very much not readable: I am opening a new issue as I can not reopen the original one. W...

@prietschka.bsky.social I appreciate your sense of humor 😀

I'm aware of this resolved issue about the diff view with light mode: github.com/google-gemin... Anything else specific about light mode support that stands out to you?

24.09.2025 17:27 — 👍 1    🔁 0    💬 1    📌 0
Preview
Google AI Pro and Ultra subscribers now get Gemini CLI and Gemini Code Assist with higher limits. Google AI Pro and Ultra subscribers now get higher limits to Gemini CLI and Gemini Code Assist IDE extensions.

If you're a Google AI Pro or Ultra subscriber, your daily limits for Gemini CLI and Code Assist just got a nice bump. Spend less time worrying about quotas, and more on building! 💻

blog.google/technology/d...

24.09.2025 16:05 — 👍 4    🔁 1    💬 0    📌 1
Agentic AI: what makes it unique?
YouTube video by Cloud with Karl Agentic AI: what makes it unique?

What really makes agentic software unique from traditional software? And is it all-or-nothing?

The essential concept is already there in the name: agency. You can control the degree of agency with tool design and human-in-the-loop patterns.

youtube.com/shorts/JIHfn...

24.09.2025 15:57 — 👍 0    🔁 0    💬 0    📌 0
Post image

The Accelerate AI with Cloud Run tour is headed to Europe!
👉 Register at goo.gle/accelerate-ai

Join me and the Google team in a hands-on workshop near you:
📍 Dublin: Oct 29 goo.gle/accelerate-ai-dublin
📍 Munich: Oct 31 goo.gle/accelerate-ai-munich
📍 Paris: Nov 4 goo.gle/accelerate-ai-paris

19.09.2025 15:08 — 👍 3    🔁 2    💬 0    📌 0
Ruff for Python Linting and Formatting
Speed up your Python development workflow with Ruff, the extremely fast Python linter and code formatter, written in Rust! In this video, I explore how Ruff can help you write cleaner, more… Ruff for Python Linting and Formatting

For a quick overview, check out the short video: www.youtube.com/shorts/1N5ub...

16.09.2025 11:40 — 👍 0    🔁 0    💬 0    📌 0
Preview
How to Write Better Python with Ruff on Google Cloud Ruff can unify your code quality toolchain, accelerate CI/CD, and integrate with AI tooling for seamless Python development.

Yes, you still need code quality checks in the era of AI. Different prompts, context, and models lead to different outcomes.

Ruff unifies a litany of Python tools to do the job. Pro tip: send any issues it can't fix automatically to the Gemini CLI.

Read more: medium.com/google-cloud...

16.09.2025 11:40 — 👍 1    🔁 0    💬 1    📌 0
Preview
Build with your Favorite Models from the Vertex AI Model Garden with LiteLLM See Qwen 3 Coder in action as a Model-as-a-Service

The blog post has all the code and step-by-step instructions: medium.com/@kweinmeiste...

08.09.2025 14:31 — 👍 0    🔁 0    💬 0    📌 0
How to Connect to Vertex AI Models with a LiteLLM Proxy
Ready to explore the vast ecosystem of AI models in Google's Vertex AI Model Garden? This video shows you how to use LiteLLM as a unified bridge to access and experiment with a diverse range of… How to Connect to Vertex AI Models with a LiteLLM Proxy

Want to use models from Google's Vertex AI Model Garden with an OpenAI-compatible API?

My new video shows how to set up LiteLLM as a local proxy to do just that. Simplify your workflow and call models like Qwen, DeepSeek, and more through a unified interface.

www.youtube.com/shorts/ntI5A...

08.09.2025 14:30 — 👍 1    🔁 0    💬 1    📌 0
Preview
In-browser semantic search with EmbeddingGemma A few days ago, Google DeepMind released a new embedding model based on the Gemma open weight model: EmbeddingGemma. With 308 million parameters, such a model is tiny enough to be able to run on edge ...

In-browser #AI semantic search 🧠 with Google's new #EmbeddingGemma embedding model and @hf.co's #Transformersjs

Gain enhanced privacy, zero server costs, low-latency results directly on your device.👇

glaforge.dev/posts/2025/0...

08.09.2025 10:14 — 👍 5    🔁 1    💬 1    📌 0
Agentic Development with Zed and Gemini CLI
Gemini CLI + Zed feels wonderfully natural. I got into the flow within minutes: 1. brew install --cask zed (on macOS) 2. Select "New Gemini CLI Thread" 3. Install Gemini CLI (if needed) Learn more… Agentic Development with Zed and Gemini CLI

Gemini CLI + Zed feels wonderfully natural. I got into the flow within minutes:

1. brew install --cask zed (on macOS)
2. Select "New Gemini CLI Thread"
3. Install Gemini CLI (if needed)

Learn more in this walkthrough: www.youtube.com/shorts/fAl--...

28.08.2025 13:56 — 👍 3    🔁 1    💬 0    📌 0

@kweinmeister is following 20 prominent accounts