Thomas Capelle's Avatar

Thomas Capelle

@capetorch.bsky.social

Chilean πŸ‡¨πŸ‡± living in France. I build DL models and pipelines. ML Engineer at W&B cargobike β™₯🚴 https://tcapelle.github.io/

710 Followers  |  483 Following  |  185 Posts  |  Joined: 05.09.2024
Posts Following

Posts by Thomas Capelle (@capetorch.bsky.social)

This is the way, glad you liked it!

19.03.2025 22:17 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

it's a really nice place, I agree!

28.02.2025 09:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

They basically jailbreak gpt-4o

25.02.2025 22:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Same vibes git commit -m "pbar"

25.02.2025 22:08 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

happy to take a look on a call =)

24.02.2025 08:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Can you share a workspace?

22.02.2025 10:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This was a team effort from @morgymcg.bsky.social , Soumik, @parambharat.bsky.social , Agata Mlynarczyk, @ayshthkr.bsky.social and many others!

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Local Weave Scorers | W&B Weave Weave's local scorers are a suite of small language models that run locally on your machine with minimal latency. These models evaluate the safety and quality of your AI system’s inputs, context, and ...

I'm excited to see how the community uses these tools, and I'm looking forward to more innovations in safe and reproducible AI!

Check the scorers and Weave here:

πŸ‘‰ wandb.me/weave_scorers

πŸ“š A colab: wandb.me/scorers_colab

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

A personal highlight was working on the Fluency Scorer powered by AnswerDotAI ModernBERT-base; we hope to move all DeBerta-powered scorers to ModernBert in the next release so we can benefit from the longer context length and training speed!

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

As part of this initiative, we also created comprehensive evaluation datasets, drawing on invaluable contributions from the open-source community. Being a reproducibility-first company, we’ve made the full recipe public, including the scorers, model weights, and the training and evaluation datasets

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We designed these non-LLM powered scorers to leverage state-of-the-art open source models – from the PleIAI/Celadon toxicity detector to the Vectara hallucination scorer – ensuring that our AI systems are evaluated across multiple dimensions.

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Local Weave Scorers | W&B Weave Weave's local scorers are a suite of small language models that run locally on your machine with minimal latency. These models evaluate the safety and quality of your AI system’s inputs, context, and ...

Over the past few months, my team at Weights & Biases has been hard at work launching Weave Scorers and guardrails.

wandb.me/weave_scorers

πŸ‘‡

21.02.2025 13:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Disappointed Cat GIF ALT: Disappointed Cat GIF
21.02.2025 06:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

We are cooking here...

15.02.2025 10:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Same vibes, PR submitted, PR merged.

14.02.2025 22:05 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

- new MacBook pro 😍
- french keyboard layout 😭

14.02.2025 07:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Many people have asked me about the France Action Summit.

I think a summit is typically most valuable as a catalyst, not as a solution in itself.

But, will share some observations.

13.02.2025 09:08 β€” πŸ‘ 42    πŸ” 10    πŸ’¬ 2    πŸ“Œ 2

It could have been called Gulf of North America

13.02.2025 08:35 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is my favorite kind of Yoga

12.02.2025 18:31 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I just built a CI to run an Eval of some custom LLM scorers on top of @modal-labs.bsky.social
- Great to test against different GPUs
- No custom runner neded on github
- Fast and nice console outputs =)

12.02.2025 10:53 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Butternut Soup and kimchi side

10.02.2025 18:41 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Échalotes, tomates sèches et moules.

09.02.2025 16:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Samedi tu fais moules et frites, dimanche tu finis les moules dans un rissoto aux moules.

09.02.2025 16:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Why is the USA the only country in the world with bird flu H5N1 ripping through cattle herds?

Why is the USA the only country in the world with bird flu H5N1 ripping through cattle herds?

Because in the United States, it’s legal to feed chicken shit to cattle.

That’s why. That’s literally the reason

www.telegraph.co.uk/global-healt...

08.02.2025 18:31 β€” πŸ‘ 14937    πŸ” 4254    πŸ’¬ 262    πŸ“Œ 426
Post image Post image

Pancakes morning with the arrival of the @vendeeglobe.bsky.social

09.02.2025 10:17 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

C'est super bon Γ§a !

08.02.2025 16:50 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Don't miss Stacey in Paris!

08.02.2025 14:00 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We raised this internally! thanks for the info.

06.02.2025 19:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

This budget forcing is really smart. We could do that we prefill on API models no?

05.02.2025 22:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

This is getting out of hands...

05.02.2025 17:04 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0