leonie's Avatar

leonie

@iamleonie.bsky.social

I do Machine Learning at Weaviate and write about it on the internet.

206 Followers  |  83 Following  |  36 Posts  |  Joined: 19.11.2024
Posts Following

Posts by leonie (@iamleonie.bsky.social)

*based on the dimensions "human vs. machine" and "realistic vs. comic"

14.01.2026 14:08 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

I've seen a lot of explanations on similarity measures in vector search but this one by my colleague
@dadoonet is by far the most fun!

How similar* is Han Solo to:
โ€ข Princess Leia: very similar
โ€ข Obi-Wan: meh
โ€ข Darth Vader: complete opposites

Talk slides: david.pilato.fr/talks/2025/2...

14.01.2026 14:08 โ€” ๐Ÿ‘ 1    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

What's the most underrated embedding technique you've used?

Static embeddings -> speed-improvements
Binary quantization -> storage-reduction
Late interaction -> added granularity

I'm curious about lesser-known approaches that worked surprisingly well.

14.05.2025 12:31 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Roses are red,
violets are blue,
A good baseline embedding model
is all-MiniLM-L6-v2.

14.02.2025 08:41 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

ETA: uses Anthropicโ€˜s citations API

11.02.2025 18:33 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Make RAG results more trustworthy with citations.

In his latest recipe, @danman966.bsky.social shows you how you can build a RAG pipeline with citations, using:
- a @weaviate.bsky.social vector database and
- @anthropic.com's Claude 3.5 Sonnet

๐Ÿ“Œ Code: github.com/weaviate/rec...

11.02.2025 15:48 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Haha, what specialized topics are you planning to catch up on in the field of AI agents?

31.01.2025 09:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Normalize not knowing everything in the AI space.

It's evolving fast.
Iโ€™m sure your to-do list is growing as fast as mine.

Here are 3 topics, I want to catch up on this quarter:

โ€ข AI agents
โ€ข Fine-tuning embedding models
โ€ข Multimodality
โ€ข (If time permits: reinforcement learning)

What about you?

31.01.2025 09:15 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Iโ€™m trying to wrap my head around multi-agent system architectures.

Here are some patterns Iโ€™m seeing so far:

1. Type of collaboration:
Network vs. hierarchical

2. Type of information flow:
Sequential vs. parallel vs. loop

3. Type of functionality:
Routing vs. aggregating

What else?

28.01.2025 17:00 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Some considerations for choosing a vector dimension:

1. Data complexity
2. Task complexity
3. Dataset size
4. Computational constraints
5. Performance requirements
6. Scalability requirements
7. Latency requirements

What else?

26.01.2025 13:03 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

#1 Rule of RAG Club: Look at your data.

With the new explorer tool, looking at your data got a lot easier in Weaviate Cloud.

The explorer tool provides a graphical interface to easily:
โ€ข Browse collections
โ€ข Inspect objects, metadata, and vectors

Check it out now: https://buff.ly/3KWivSF

22.01.2025 16:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

You can be GPU poor like me and still fine-tune an LLM.

Hereโ€™s how you can fine-tune Gemma 2 in a Kaggle notebook on a single T4 GPU:
โ€ข @kaggle.com offers 30 hours/week of GPUs for free
โ€ข @unsloth.bsky.social uses 60% less memory to fit it on a T4 GPU

๐Ÿ”—Code: https://buff.ly/4apUUG2

21.01.2025 16:00 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Although I know that

Vertical scaling: scaling up (to a more powerful machine)

Horizontal scaling: scaling out (to multiple smaller machines)

I still always have to take a second to think about it.

Itโ€™s like the left-right-weakness of system design.

18.01.2025 10:03 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

I talk about RAG so much, I could fill a book.

So, we did - and you can download it for free.

Together with my colleagues Mary & Prajjwal, we curated an e-book of the most effective advanced RAG techniques.

Which ones did we miss?

Get it now: weaviate.io/ebooks/advan...

16.01.2025 12:46 โ€” ๐Ÿ‘ 7    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Over the holidays, I learned how to fine-tune an LLM.

Hereโ€™s my entry for the latest @kaggle.com comp.

This tutorial shows you:
โ€ข Fine-tune Gemma 2
โ€ข LoRA fine-tuning with @unsloth.bsky.social on T4 GPU
โ€ข Experiment tracking with @weightsbiases.bsky.social

๐Ÿ”—Code: www.kaggle.com/code/iamleon...

15.01.2025 12:50 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Thanks! Merry Christmas to you, too, Tomaz!

20.12.2024 08:09 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Got myself a little early Christmas present.

Although this book is from 2017, I heard so many good things about it this year.

Can't wait to dig into this over the holidays.

And with that being said, I hope you have some nice and relaxing holidays yourself!

See you in the new year!

19.12.2024 15:55 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
2023 in Review: Recapping the Post-ChatGPT Era and What to Expect for 2024 How the LLMOps landscape has evolved and why we havenโ€™t seen many Generative AI applications in the wild yetโ€Šโ€”โ€Šbut maybe in 2024.

Last yearโ€™s predictions: towardsdatascience.com/2023-in-revi...

17.12.2024 18:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

To make it a little bit more fun, Iโ€™m making some bolder predictions for 2025 this time:
โ€ข Video will be an important modality
โ€ข Moving from one-shot to agentic to human-in-the-loop
โ€ข Fusion of AI and crypto
โ€ข Latency and cost per token will drop

What other trends are you observing in the AI space?

17.12.2024 18:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Itโ€™s time to review the AI space in 2024!

Hereโ€™s what I got right (and what I missed) in my 2024 predictions:

โœ…ย Evaluation
โŒย Multimodal foundation models
โŒย Fine-tuning open-weight models and quantization
โŒย AI agents
โœ…ย RAG lives on
โŒย Knowledge graphs

medium.com/towards-data...

17.12.2024 18:30 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

ๆ—ฅๆœฌ่ชžใƒ†ใ‚ญใ‚นใƒˆๅ‘ใ‘ใฎใƒใ‚คใƒ–ใƒชใƒƒใƒ‰ๆคœ็ดขใซใฏๆ—ฅๆœฌ่ชžใƒ†ใ‚ญใ‚น็”จใฎใƒˆใƒผใ‚ฏใƒŠใ‚คใ‚ถใƒผใŒๅฟ…่ฆใงใ™ใ€‚

@weaviate.bsky.socialใงใฏ๏ผ“ใคใฎใƒˆใƒผใ‚ฏใƒŠใ‚คใ‚ถใƒผใ‚’ไฝฟ็”จใ™ใ‚‹ใ“ใจใŒใงใใพใ™ใ€‚

ไธ€ใคใšใคใฎใƒกใƒชใƒƒใƒˆใจใƒ‡ใƒกใƒชใƒƒใƒˆใฏใ“ใกใ‚‰
weaviate.io/blog/hybrid-...

17.12.2024 14:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Struggling to keep up with new RAG variants?

Hereโ€™s a cheat sheet of 7 of the most popular RAG architectures.

Which variants did we miss?

10.12.2024 17:00 โ€” ๐Ÿ‘ 14    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

ใƒใ‚คใƒ–ใƒชใƒƒใƒ‰ๆคœ็ดขใจใฏไฝ•๏ผŸ

ใƒใ‚คใƒ–ใƒชใƒƒใƒ‰ๆคœ็ดขใฏใ€ใƒ‡ใƒณใ‚นใƒ™ใ‚ฏใƒˆใƒซใจใ‚นใƒ‘ใƒผใ‚นใƒ™ใ‚ฏใƒˆใƒซใ‚’็ตฑๅˆใ—ใฆใ€ใใ‚Œใžใ‚Œใฎๆคœ็ดขๆ‰‹ๆณ•ใฎๅˆฉ็‚นใ‚’ๆดปใ‹ใ—ใพใ™ใ€‚

ใ“ใฎ่จ˜ไบ‹ใงใฏใ€Weaviateใฎๆ—ฅๆœฌ่ชžใƒ†ใ‚ญใ‚นใƒˆๅ‘ใ‘ใฎใƒใ‚คใƒ–ใƒชใƒƒใƒ‰ๆคœ็ดขใฎ่ชฌๆ˜Žใ‚’ใ—ใพใ™ใ€‚

- ๆ—ฅๆœฌ่ชžใƒ†ใ‚ญใ‚น็”จใฎใƒˆใƒผใ‚ฏใƒŠใ‚คใ‚ถใƒผใ‚’ไฝฟ็”จใ™ใ‚‹ใ‚ญใƒผใƒฏใƒผใƒ‰ๆคœ็ดข
- ใƒ™ใ‚ฏใƒˆใƒซๆคœ็ดข
- ่žๅˆใ‚ขใƒซใ‚ดใƒชใ‚บใƒ 

่ฉณใ—ใใฏใ“ใกใ‚‰
https://buff.ly/49yMR9K

10.12.2024 23:02 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Yaaaay!

04.12.2024 20:23 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Developing Apps with GPT-4 and ChatGPT, 2nd Edition This book provides an ideal guide for Python developers who want to learn how to build applications with large language models. Authors Olivier Caelen and Marie-Alice Blete cover the main โ€ฆ - Selecti...

By the way: The star fish on the cover makes a special appearance in the book. Did you spot it?

๐Ÿ“Œย Link to the book: www.oreilly.com/library/view...

04.12.2024 17:03 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Look what came in the mail today!

This is already the 2nd edition of โ€œDeveloping apps with GPT-4โ€ by Olivier and Marie-Alice I had the pleasure to review.

This edition covers the latest advancements in GPT-4, especially regarding its visual capabilities to build multimodal applications.

04.12.2024 17:03 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Oh, this is so neat. Thanks for sharing. Canโ€™t wait to dig in.

03.12.2024 18:59 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

It's been two years since the release of ChatGPT.

What cool use cases using Generative AI have you seen in the wild so far?

30.11.2024 16:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
recipes/integrations/data-platforms/ibm/docling/rag_over_pdfs_docling_weaviate.ipynb at main ยท weaviate/recipes This repository shares end-to-end notebooks on how to use various Weaviate features and integrations! - weaviate/recipes

Hereโ€™s a recipe notebook by Mary on RAG over PDF files using Docling and @weaviate.bsky.social.

github.com/weaviate/rec...

28.11.2024 13:34 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Struggling with RAG over PDF files?

You might want to give Docling a try.

๐—ช๐—ต๐—ฎ๐˜'๐˜€ ๐——๐—ผ๐—ฐ๐—น๐—ถ๐—ป๐—ด?
โ€ข Python package by IBM
โ€ข OS (MIT license)
โ€ข PDF, DOCX, PPTX โ†’ Markdown, JSON

๐—ช๐—ต๐˜† ๐˜‚๐˜€๐—ฒ ๐——๐—ผ๐—ฐ๐—น๐—ถ๐—ป๐—ด?
โ€ข Doesnโ€™t require fancy gear, lots of memory, or cloud services
โ€ข Works on regular computers or Google Colab Pro

28.11.2024 13:34 โ€” ๐Ÿ‘ 13    ๐Ÿ” 2    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0