vLLM for beginners: Deployment Options (PartIII) - Cloudthrill
In this final part, weโll shift from theory to practice, covering how to deploy vLLM across different environments, from source builds to docker containers. In this series, we aim to provide a solid foundation of vLLM core concepts to help you understand how it works and why itโs emerging as a de facto choice for LLM deployment.
๐#BlogSeries #vllm๐ฅ
๐ฏ๐๐๐ ๐๐จ๐ซ ๐๐๐ ๐ข๐ง๐ง๐๐ซ๐ฌ ๐๐๐ซ๐ญ ๐:๐ ๐๐๐ฉ๐ฅ๐จ๐ฒ๐ฆ๐๐ง๐ญ ๐๐ฉ๐ญ๐ข๐จ๐ง๐ฌ
Learn to deploy #vLLM everywhere! Even on CPU๐คซ
โ
Platform & model Support Matrix
โ
Install on GPU & CPU
โ
Build Wheel from scratch | Python vLLM package
โ
Docker/Kubernetes Deployment
โ
Running vLLM server (Offline + Online inference)
05.08.2025 13:49 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
YouTube video by CloudDude
๐ดTechBeats live : LLM Quantization "vLLM vs. Ollama"
๐๏ธ Join our ๐ด๐๐๐๐ก ๐๐๐๐ญ๐ฌ ๐๐ข๐ฏ๐ Show!
๐๏ธ Thursday 17th 11:30 AM EDT
๐ฏ A chill livestream unpacking LLM #Quantization: #vllm vs #ollama. Learn about the What & How.
๐ฅDope guest stars:
#bartowski from arcee.ai & Eldar Kurtic from #RedHat
๐Stream on YouTube & Linkedin:
www.youtube.com/watch?v=XTE0...
10.07.2025 14:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
vLLM for beginners: Key Features & Performance Optimization(PartII) - Cloudthrill
In this series, we aim to provide a solid foundation of vLLM core concepts to help you understand how it works and why itโs emerging as a defacto choice for LLM deployment.
๐#NewBlog #vllm๐ฅ
๐ฏ๐๐๐ ๐๐จ๐ซ ๐๐๐ ๐ข๐ง๐ง๐๐ซ๐ฌ ๐๐๐ซ๐ญ ๐:๐๐๐๐ฒ ๐
๐๐๐ญ๐ฎ๐ซ๐๐ฌ & ๐๐ฉ๐ญ๐ข๐ฆ๐ข๐ณ๐๐ญ๐ข๐จ๐งs
๐ What makes #vLLM the Rolls Royce of inference?
๐check it out: cloudthrill.ca/what-is-vllm...
โ
#PagedAttention #PrefixCaching #ChunkedPrefill
โ
#SpeculativeDecoding #FlashAttention #lmcache
โ
Tensor & #PipelineParallelismโก
02.07.2025 15:19 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
vLLM for beginners: The Fundamentals - Cloudthrill
In this series, we aim to provide a solid foundation of vLLM core concepts to help you understand how it works and why itโs emerging as a defacto choice for LLM deployment.
๐#NewBlogAlert "What is #vLLM ?"
Weโre kicking off our ๐ฏ๐๐๐ ๐๐จ๐ซ ๐๐๐ ๐ข๐ง๐ง๐๐ซ๐ฌ ๐ฌ๐๐ซ๐ข๐๐ฌ with
๐๐๐ซ๐ญ ๐:๐ ๐๐ก๐ ๐
๐ฎ๐ง๐๐๐ฆ๐๐ง๐ญ๐๐ฅ๐ฌ๐ซ
New to vLLM ? This one's for you๐๐ป: cloudthrill.ca/what-is-vllm
โ
What is vLLM ( vLLM vs Ollama)
โ
Core Architecture (Engine, Sched, Execution, Memory)
โ
Offline and Online inference
17.06.2025 16:53 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 1
Ep06: "GitHub Security horror stories " (withย Steveย Giguere)
Tech Beats Unplugged ยท Episode
๐๐๐๐๐ก ๐๐๐๐ญ๐ฌ ๐๐ง๐ฉ๐ฅ๐ฎ๐ ๐ ๐๐ is BACK #Episode 06 !!!๐๐ป
๐ง๐ฅ"๐๐ข๐ญ๐๐ฎ๐ ๐๐๐๐ฎ๐ซ๐ข๐ญ๐ฒ ๐ก๐จ๐ซ๐ซ๐จ๐ซ ๐ฌ๐ญ๐จ๐ซ๐ข๐๐ฌ with
#SteveGiguere "โข๏ธ...tons of ๐๐ญ๐ญ๐๐๐ค ๐ฏ๐๐๐ญ๐จ๐ซ๐ฌ, best practices, and a lot of laughs๐
. You don't wanna miss this !
Thank you Steve๐๐ป
๐๐ป spoti.fi/4dYicES ๐๐ป
10.06.2025 14:18 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
๐๏ธ๐๐๐๐ก ๐๐๐๐ญ๐ฌ ๐๐ง๐ฉ๐ฅ๐ฎ๐ ๐ ๐๐ is back! new episode drops tomorrow!๐ฅ๐Mystery guest. Bold truths. Shocking discoveries youโre not ready to hear๐ฑ ๐งMics are hot, truths are hotter. Stay tunned !!๐
#TechPodcast #GitSecStory
09.06.2025 18:45 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
#NewBlog ๐๐ ๐๐ฎ๐ฐ๐ต๐ฒ ๐๐
๐ฝ๐น๐ฎ๐ถ๐ป๐ฒ๐ฑ: like I'm 5๐
๐ง Ever wondered what #KVCache really is in LLM inference? Forget the math-heavy blablaโthis one's made to click !
๐check it out: cloudthrill.ca/kv_cache-exp...
@Cloud_Thrill
#vLLM #AIInfra #lmcache
27.05.2025 20:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
๐ง First thing to do when using Jules from google and connecting your GitHub..! #GoogleJules
21.05.2025 16:32 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
HashiCorp Vault for Dummies: Setup your 1st Vault with TLS (WSL) - Cloudthrill
n this guide, you'll learn how to set up a local Vault server using Raft storage and TLS in a WSL (Windows Subsystem for Linux) environment. Whether you're just starting with secrets management, prepp...
๐ #NewBlogAlert ๐ก๏ธ #HashiCorpVault
I'm kicking off a ๐๐๐ฎ๐ฅ๐ญ ๐๐จ๐ซ ๐๐ฎ๐ฆ๐ฆ๐ข๐๐ฌ ๐ฌ๐๐ซ๐ข๐๐ฌ with
๐๐๐ซ๐ญ ๐:๐ ๐๐จ๐ฐ ๐ญ๐จ ๐๐๐ญ ๐๐ฉ ๐๐๐ฌ๐ก๐ข๐๐จ๐ซ๐ฉ ๐๐๐ฎ๐ฅ๐ญ with ๐๐๐๐ญ & ๐๐๐
๐check it out: tinyurl.com/HashiVault-f...
@cloudthrill.bsky.social
20.05.2025 17:02 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
Explore OCI DevOps: Pipeline your Terraform Stack from GitHub to OCI
Event posted by about Explore OCI DevOps: Pipeline your Terraform Stack from GitHub to OCI on AIOUG.
๐จCatch me live at @AIOUG tomorrow!๐
Pumped to join India Oracle Users Group for a live session #OCIDevOps #Terraform
๐๏ธ May 16 | ๐ 8:30 AM EST / 6:00 PM IST
Save your spot: aioug.org/events/explo...
15.05.2025 17:09 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
History Doesn't Repeat Itself, but It Often Rhymes...
14.05.2025 19:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Exciting news for my team at @cloudthrill.bsky.social !โฅ๏ธwe're officially accepted into the ๐๐๐๐๐๐๐ ๐๐ง๐๐๐ฉ๐ญ๐ข๐จ๐ง ๐๐ซ๐จ๐ ๐ซ๐๐ฆ
12.05.2025 21:35 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
YouTube video by CloudDude
Kelsey Hightower: Why Black Founders still struggle in Silicon valley? (ep03)
๐๏ธIn ๐๐ฉ๐๐, I sat down with @kelseyhightower.com to unpack a tough truth and he kept it ๐ฏ
โก๏ธWith only ๐.๐% of VC funding, ๐๐ก๐ฒ ๐๐ฅ๐๐๐ค ๐
๐จ๐ฎ๐ง๐๐๐ซ๐ฌ ๐ฌ๐ญ๐ข๐ฅ๐ฅ ๐ฌ๐ญ๐ซ๐ฎ๐ ๐ ๐ฅ๐ ๐ข๐ง ๐๐ข๐ฅ๐ข๐๐จ๐ง ๐๐๐ฅ๐ฅ๐๐ฒ?
๐บlisten to the full take here: youtu.be/gsMjYZZOBe0 #TechBeatsUnplugged #NumbersDonLieโ๐พ
06.05.2025 13:38 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
TAICO May 2025 meetup!, Wed, May 7, 2025, 5:30 PM | Meetup
The TAICO team is proud to announce our next meetup at the Adaptavist office in Toronto. Much thanks to [Adaptavist](https://www.adaptavist.com/ "https://www.adaptavist.com
๐จ#AI & #CyberSec heads in #Toronto!
Join us on Wednesday, ๐๐๐ฒ ๐๐ญ๐ก from 5:30pm-8pm EST for another exciting #TAICO Meetup (Toronto AI and Cybersecurity Organization).
#Cloudthrill #ProudSponsor๐ฅ
www.meetup.com/taico-toront...
03.05.2025 01:16 โ ๐ 0 ๐ 1 ๐ฌ 0 ๐ 0
YouTube video by CloudDude
Meta announced ๐๐๐ญ๐ ๐๐๐, SDKs with inference platform (race for cost trimming)
๐ตMeta announced ๐๐๐ญ๐ ๐๐๐, SDKs with inference, eval/ tuning platform.. officially Joining AI API business ... But Size always matters according to @databricksinc.bsky.social CEO's Ali ghodsi
youtu.be/mGtQLBPw4iU?...
#AI #Inference #TokenCost #llamaCon #llama๐ฆ
30.04.2025 00:39 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Reality Check
I'm sick and god-damn tired of this! I have written tens of thousands of words about this and still, to this day, people are babbling about the "AI revolution" as the sky rains blood and crevices open...
Newsletter: I am sick and god damn tired of everybody pretending that generative AI is the next big thing. The media is complicit in accepting fantastical nonsense - both in the numbers put out by OpenAI and the silly jobs created by Anthropic - and it has to stop.
www.wheresyoured.at/reality-check/
28.04.2025 17:11 โ ๐ 4245 ๐ 1078 ๐ฌ 99 ๐ 129
How to pass the CKA certification - Cloudthrill
CKA is 100% hands-onโno multiple-choice, just real-world challenges. In this post, Iโll break down:โ
Exam structure & key domainsโ
How I prepared (resources, labs, and time investment)โ
Tips to ace it...
#NewBlog ๐๐จ๐ฐ ๐ญ๐จ ๐๐๐ฌ๐ฌ ๐ญ๐ก๐ ๐๐๐ ๐๐๐ซ๐ญ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง โ ๐๐จ ๐๐๐ญ๐๐ค๐๐ฌ!๐
Time to refocus on your goalsโlike finally crushing that elusive ๐๐๐ ๐๐ฑ๐๐ฆ with my curated guide on:
โ
Best resources, hands-on labs, time investment tips
โ
D-day strategies, CLI tricks that save you time
๐ฅJust Do it๐ช
๐๐ป buff.ly/S60cXwN #CNCF
28.04.2025 20:51 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 1
21.04.2025 20:58 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I know. Europe will also apply the same for north americans (ETIAS). It just didn't sound logical to see labor laws applied on travelers working remotely for foreign companies on a short stay (couple weeks).
It's like when ICE ruled to deport someone but then denied bail because of a flight risk;)
21.04.2025 20:56 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I can read the word ESTA, still can't help but ask if this applies to non-citizens (rhetorical).
20.04.2025 15:11 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0
20.04.2025 15:01 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
YouTube video by Wiz
CISO MUSICAL | Official Broadway Trailer
$32B for this!... youtu.be/4W17F9Ho_38?...
17.04.2025 17:57 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
๐ง ๐๐๐ ๐๐จ๐๐๐ฅ๐ฌ ๐๐จ๐ซ ๐๐ฎ๐ฆ๐ฆ๐ข๐๐ฌ #cheatsheet
๐ค If youโve opened #ChatGPT lately and thought:
โ๐๐๐ข๐ญโฆ ๐ฐ๐ก๐๐ญโ๐ฌ ๐จ๐? ๐๐ง๐ ๐ฐ๐ก๐ฒ ๐๐ซ๐ ๐ญ๐ก๐๐ซ๐ ๐ฌ๐จ ๐ฆ๐๐ง๐ฒ ๐ฆ๐จ๐๐๐ฅ๐ฌ ๐ง๐จ๐ฐ?โ Youโre not alone. Today #openAI finally answered๐๐ปโโ๏ธ
๐๐ปhttps://platform.openai.com/docs/models/compare
16.04.2025 18:23 โ ๐ 0 ๐ 1 ๐ฌ 0 ๐ 0
Turn Your Localhost into a FREE Public URL with Ngrok & Zrok -part 1 - Cloudthrill
Ngrok VS Zrok allow you to securely expose your local services to the internet. Plus, they provide free static domainsโno extra configuration needed. Letโs dive in!
๐#NewBlogpost ๐๐จ ๐๐ฎ๐๐ฅ๐ข๐ ๐๐? ๐๐จ ๐๐ซ๐จ๐๐ฅ๐๐ฆ!
Turn Your ๐๐จ๐๐๐ฅ๐ก๐จ๐ฌ๐ญt into a FREE public URL
with ๐๐ซ๐จ๐ค from @openziti!โ This ๐๐๐ซ๐จ๐๐ซ๐ฎ๐ฌ๐ญ tool allows you to securely expose your local apps to the internet for FREE! ๐ ๐ cloudthrill.ca/ngrok-vs-zrok-part1
15.04.2025 18:38 โ ๐ 1 ๐ 1 ๐ฌ 0 ๐ 1
This is not the revolution I expected from AI ...;) #AIEntitlement
04.04.2025 17:04 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
๐ฅSuper pumped to speak about AI inference in OKE, See you next Wednesday (April 2th)! RSVP below๐๐ป
27.03.2025 15:55 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0
258,341 students... Go be a consultant they said ๐ฅน๐ฅฒ๐ฅน
21.03.2025 16:17 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
British, But In Las Vegas and NYC
ezitron.76 Sig
Newsletter - wheresyoured.at
https://linktr.ee/betteroffline - podcast w/ iheartradio
Chosen by god, perfected by science
CEO at EZPR.com - Award-Winning Tech PR
CEO @civo.com - disruptive challenger to the hyperscale providers...building the cloud how it should have always been. I love to innovate and inspire people.
Welcome to CloudThrill, a dynamic consulting firm specializing in cloud automation and DevOps services. Founded and led by our passionate
@clouddude.bsky.social
Immigrant || Aspiring Taco Truck Taquero || Product Manager - Oracle || My opinions || Pura Vida ๐จ๐ท he/him
Semgrep is a code scanning platform for finding first and third-party security vulnerabilities in your code base.
Working @42talents. Spring Trainer @VMwareTanzu. Co-Organizer @VoxxedZurich, Board member of @jugch, Software Crafters Zurich & SoCraTes Conf Switzerland
Danish geek ๐ค Oracle SQL & PL/SQL Developer โ ACE Director 4๏ธโฃ2๏ธโฃ #SYM42 ๐จโ๐ณ Likes to cook ๐ Reads sci-fi ๐บ Beer Enthusiast ๐ข Cegal Danmark A/S ๐งโ๐ป https://kibeha.dk
Dad. CTO of @tremolo.io . Star Wars, Kubernetes, identity and access management and the Yankees. Co-author of Kubernetes: An Enterprise Guide, 3rd ed https://www.amazon.com/dp/B0CT8M958T
โญ Lift Implementation Technical Lead - Database at @Oracle
๐ฉโ๐ป Geek girl, promoting #womenintech
โ ๏ธ Oracle ACE Alumni
๐๏ธ Views my own
๐งโโ๏ธ Open Sourceress
๐ Ms. (f)Rizzle @Cisco
๐ณ๏ธโโง๏ธ Blame for Oops & Ops my own
๐ค Seize the means of computing
๐ฉโ๐ป K8s/AWS/Azure Platform Engineer
๐ https://blog.usrbinkat.io/en/page/about
๐ฟWatching Dystopia IRL
๐ง Neuro Spicy Autist
๐ Sacramento โ
Co-leader OWASP Cornucopia. If you like what we do for open source, visit our code repository https://github.com/OWASP/cornucopia and give us a star โญ
๐ ยซDifference is of the essence of humanityยป โ John Hume
#appsec #owasp #cornucopia #threatmodeling
Best-selling author of Alice and Bob Learn Secure Coding & Alice and Bob Learn Application Security. Secure Code Trainer - Nerd @Semgrep #AppSec she/her
https://shehackspurple.ca ๐ป
Host of Lex Fridman Podcast.
Interested in robots and humans.
Founder Tech For Palestine, Darklang, and CircleCI. Currently running Tech for Palestine https://techforpalestine.org
Writing The Pragmatic Engineer (@pragmaticengineer.com), the #1 technology newsletter on Substack. Author of The Software Engineer's Guidebook (engguidebook.com). Formerly at Uber, Skype, Skyscanner. More at pragmaticengineer.com
Host, โVelshiโ, Sat/Sun 10a-12pET, "The Last Word" Fri 10pET on MSNBC
Co-founder of HashiCorp. Passionate about technology and startups. I love to build things.
๐ซ Data Loss - ๐ฏ Uptime - ๐ก๏ธ Data Guard Product Manager at โญ #OracleMAA - ๐กOpinions are my own - ๐๐๏ธ๐ต๐๐ถ๐จโ๐ป๐น๏ธ
Your database is down because you didn't listen to me.
I put databases in containers so I can cook, buy knives, run & give dogs the life they deserve. He/him.
Backupโ Recovery. Dogs>People. Trans Rights=Human Rights.