*based on the dimensions "human vs. machine" and "realistic vs. comic"
14.01.2026 14:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0*based on the dimensions "human vs. machine" and "realistic vs. comic"
14.01.2026 14:08 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
I've seen a lot of explanations on similarity measures in vector search but this one by my colleague
@dadoonet is by far the most fun!
How similar* is Han Solo to:
โข Princess Leia: very similar
โข Obi-Wan: meh
โข Darth Vader: complete opposites
Talk slides: david.pilato.fr/talks/2025/2...
What's the most underrated embedding technique you've used?
Static embeddings -> speed-improvements
Binary quantization -> storage-reduction
Late interaction -> added granularity
I'm curious about lesser-known approaches that worked surprisingly well.
Roses are red,
violets are blue,
A good baseline embedding model
is all-MiniLM-L6-v2.
ETA: uses Anthropicโs citations API
11.02.2025 18:33 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
Make RAG results more trustworthy with citations.
In his latest recipe, @danman966.bsky.social shows you how you can build a RAG pipeline with citations, using:
- a @weaviate.bsky.social vector database and
- @anthropic.com's Claude 3.5 Sonnet
๐ Code: github.com/weaviate/rec...
Haha, what specialized topics are you planning to catch up on in the field of AI agents?
31.01.2025 09:51 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Normalize not knowing everything in the AI space.
It's evolving fast.
Iโm sure your to-do list is growing as fast as mine.
Here are 3 topics, I want to catch up on this quarter:
โข AI agents
โข Fine-tuning embedding models
โข Multimodality
โข (If time permits: reinforcement learning)
What about you?
Iโm trying to wrap my head around multi-agent system architectures.
Here are some patterns Iโm seeing so far:
1. Type of collaboration:
Network vs. hierarchical
2. Type of information flow:
Sequential vs. parallel vs. loop
3. Type of functionality:
Routing vs. aggregating
What else?
Some considerations for choosing a vector dimension:
1. Data complexity
2. Task complexity
3. Dataset size
4. Computational constraints
5. Performance requirements
6. Scalability requirements
7. Latency requirements
What else?
#1 Rule of RAG Club: Look at your data.
With the new explorer tool, looking at your data got a lot easier in Weaviate Cloud.
The explorer tool provides a graphical interface to easily:
โข Browse collections
โข Inspect objects, metadata, and vectors
Check it out now: https://buff.ly/3KWivSF
You can be GPU poor like me and still fine-tune an LLM.
Hereโs how you can fine-tune Gemma 2 in a Kaggle notebook on a single T4 GPU:
โข @kaggle.com offers 30 hours/week of GPUs for free
โข @unsloth.bsky.social uses 60% less memory to fit it on a T4 GPU
๐Code: https://buff.ly/4apUUG2
Although I know that
Vertical scaling: scaling up (to a more powerful machine)
Horizontal scaling: scaling out (to multiple smaller machines)
I still always have to take a second to think about it.
Itโs like the left-right-weakness of system design.
I talk about RAG so much, I could fill a book.
So, we did - and you can download it for free.
Together with my colleagues Mary & Prajjwal, we curated an e-book of the most effective advanced RAG techniques.
Which ones did we miss?
Get it now: weaviate.io/ebooks/advan...
Over the holidays, I learned how to fine-tune an LLM.
Hereโs my entry for the latest @kaggle.com comp.
This tutorial shows you:
โข Fine-tune Gemma 2
โข LoRA fine-tuning with @unsloth.bsky.social on T4 GPU
โข Experiment tracking with @weightsbiases.bsky.social
๐Code: www.kaggle.com/code/iamleon...
Thanks! Merry Christmas to you, too, Tomaz!
20.12.2024 08:09 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
Got myself a little early Christmas present.
Although this book is from 2017, I heard so many good things about it this year.
Can't wait to dig into this over the holidays.
And with that being said, I hope you have some nice and relaxing holidays yourself!
See you in the new year!
Last yearโs predictions: towardsdatascience.com/2023-in-revi...
17.12.2024 18:30 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
To make it a little bit more fun, Iโm making some bolder predictions for 2025 this time:
โข Video will be an important modality
โข Moving from one-shot to agentic to human-in-the-loop
โข Fusion of AI and crypto
โข Latency and cost per token will drop
What other trends are you observing in the AI space?
Itโs time to review the AI space in 2024!
Hereโs what I got right (and what I missed) in my 2024 predictions:
โ
ย Evaluation
โย Multimodal foundation models
โย Fine-tuning open-weight models and quantization
โย AI agents
โ
ย RAG lives on
โย Knowledge graphs
medium.com/towards-data...
ๆฅๆฌ่ชใใญในใๅใใฎใใคใใชใใๆค็ดขใซใฏๆฅๆฌ่ชใใญใน็จใฎใใผใฏใใคใถใผใๅฟ
่ฆใงใใ
@weaviate.bsky.socialใงใฏ๏ผใคใฎใใผใฏใใคใถใผใไฝฟ็จใใใใจใใงใใพใใ
ไธใคใใคใฎใกใชใใใจใใกใชใใใฏใใกใ
weaviate.io/blog/hybrid-...
Struggling to keep up with new RAG variants?
Hereโs a cheat sheet of 7 of the most popular RAG architectures.
Which variants did we miss?
ใใคใใชใใๆค็ดขใจใฏไฝ๏ผ
ใใคใใชใใๆค็ดขใฏใใใณในใใฏใใซใจในใใผในใใฏใใซใ็ตฑๅใใฆใใใใใใฎๆค็ดขๆๆณใฎๅฉ็นใๆดปใใใพใใ
ใใฎ่จไบใงใฏใWeaviateใฎๆฅๆฌ่ชใใญในใๅใใฎใใคใใชใใๆค็ดขใฎ่ชฌๆใใใพใใ
- ๆฅๆฌ่ชใใญใน็จใฎใใผใฏใใคใถใผใไฝฟ็จใใใญใผใฏใผใๆค็ดข
- ใใฏใใซๆค็ดข
- ่ๅใขใซใดใชใบใ
่ฉณใใใฏใใกใ
https://buff.ly/49yMR9K
Yaaaay!
04.12.2024 20:23 โ ๐ 0 ๐ 0 ๐ฌ 0 ๐ 0
By the way: The star fish on the cover makes a special appearance in the book. Did you spot it?
๐ย Link to the book: www.oreilly.com/library/view...
Look what came in the mail today!
This is already the 2nd edition of โDeveloping apps with GPT-4โ by Olivier and Marie-Alice I had the pleasure to review.
This edition covers the latest advancements in GPT-4, especially regarding its visual capabilities to build multimodal applications.
Oh, this is so neat. Thanks for sharing. Canโt wait to dig in.
03.12.2024 18:59 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0
It's been two years since the release of ChatGPT.
What cool use cases using Generative AI have you seen in the wild so far?
Hereโs a recipe notebook by Mary on RAG over PDF files using Docling and @weaviate.bsky.social.
github.com/weaviate/rec...
Struggling with RAG over PDF files?
You might want to give Docling a try.
๐ช๐ต๐ฎ๐'๐ ๐๐ผ๐ฐ๐น๐ถ๐ป๐ด?
โข Python package by IBM
โข OS (MIT license)
โข PDF, DOCX, PPTX โ Markdown, JSON
๐ช๐ต๐ ๐๐๐ฒ ๐๐ผ๐ฐ๐น๐ถ๐ป๐ด?
โข Doesnโt require fancy gear, lots of memory, or cloud services
โข Works on regular computers or Google Colab Pro