Diego de las Casas 's Avatar

Diego de las Casas

@dlsq.bsky.social

AI Scientist at Mistral AI. Past: Google DeepMind. ๐Ÿ‡ง๐Ÿ‡ท in ๐Ÿ‡ฌ๐Ÿ‡ง

2,336 Followers  |  335 Following  |  22 Posts  |  Joined: 09.03.2024  |  2.2856

Latest posts by dlsq.bsky.social on Bluesky

We're probably at peak research science. Low level stuff is getting automated, coding is basically reviewing, plots are beautiful, everyone gets a brainstorming buddy. Work is mostly ideation/planning. Soon it might become mostly meetings/auditing. Hope I'm wrong, work is pretty fun right now!

05.02.2026 12:51 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Exposed Moltbook Database Let Anyone Take Control of Any AI Agent on the Site 'It exploded before anyone thought to check whether the database was properly secured.'

The AI community is re-learning 20 years of cybersecurity. The hard way www.404media.co/exposed-molt...

01.02.2026 01:10 โ€” ๐Ÿ‘ 58    ๐Ÿ” 11    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 2

Humming in denial as Material 3 takes over all my screens

04.11.2025 00:34 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Atrapanubes is such a good Chilean beer. Great taste, great art, great name.

16.03.2025 23:25 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Gotta wait until he double-crosses Indiana Jones to steal the Holy Grail, I'm afraid.

09.03.2025 14:02 โ€” ๐Ÿ‘ 107    ๐Ÿ” 2    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Preview
Mistral OCR New closed-source specialist OCR model by Mistral - you can feed it images or a PDF and it produces Markdown with optional embedded images. It's available [via their API](https://docs.mistral.ai/api/#tag/ocr), or โ€ฆ

I wrote a CLI script to run PDFs through the new Mistral OCR API model (with some help from Claude) - details on that and notes on the new model here: https://simonwillison.net/2025/Mar/7/mistral-ocr/

07.03.2025 01:40 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
โ€ŽLe Chat by Mistral AI โ€ŽLe Chat combines powerful AI with extensive information on the web to help you rediscover the world. Enjoy natural conversations, real-time internet search, comprehensive document analysis, and much ...

App store: apps.apple.com/us/app/le-ch...

Play store: play.google.com/store/apps/d...

06.02.2025 18:08 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
The all new le Chat: Your AI assistant for life and work | Mistral AI Brand new features, iOS and Android apps, Pro, Team, and Enterprise tiers.

check out more in our post:
mistral.ai/en/news/all-...

06.02.2025 18:08 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

We've upgraded Le Chat and it's blazing fast right now!
Also available for Android and iOS as of today
mistral.ai/en/news/all-...

06.02.2025 18:07 โ€” ๐Ÿ‘ 11    ๐Ÿ” 2    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Preview
mistralai/Ministral-8B-Instruct-2410 ยท Hugging Face Weโ€™re on a journey to advance and democratize artificial intelligence through open source and open science.

Have you tried our 8B model?
huggingface.co/mistralai/Mi...

31.01.2025 13:45 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
mistral-small Mistral Small 3 sets a new benchmark in the โ€œsmallโ€ Large Language Models category below 70B.

Mistral Small 3 is also available on many partner platforms:
- Ollama: ollama.com/library/mist...
- Kaggle: kaggle.com/models/mistr...
- Fireworks: fireworks.ai/models/firew...
- Together: together.ai/blog/mistral...

And many more soon!

30.01.2025 21:17 โ€” ๐Ÿ‘ 6    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image Post image

Performance of Mistral Small 3 Instruct model
huggingface.co/mistralai/Mi...

30.01.2025 21:17 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Mistral Small 3 Base model
huggingface.co/mistralai/Mi...

30.01.2025 21:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

Mistral Small 3 architecture is optimised for latency while preserving high quality

30.01.2025 21:17 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Mistral Small 3 Apache 2.0, 81% MMLU, 150 tokens/s

We're releasing Mistral Small 3!
- 24B params, 81% MMLU
- Latency optimized: 150 tokens/s
- Competitive with Llama-3.3 70B, Qwen-2.5 32B, GPT4o-mini
- Apache 2.0
mistral.ai/news/mistral...

30.01.2025 21:17 โ€” ๐Ÿ‘ 49    ๐Ÿ” 7    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 1
Post image

What people are going to do with AGI

26.01.2025 16:30 โ€” ๐Ÿ‘ 95    ๐Ÿ” 9    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Screen cap from one of the Thor movies featuring a dark haired pale skinned woman as Thor's sister Hela.  She has her hand out stopping Thor's hammer (Mjรถlnir) in mid air.  The hammer is labeled "It's basic biology".  Hela is labeled "Advanced Biology"

Screen cap from one of the Thor movies featuring a dark haired pale skinned woman as Thor's sister Hela. She has her hand out stopping Thor's hammer (Mjรถlnir) in mid air. The hammer is labeled "It's basic biology". Hela is labeled "Advanced Biology"

I know, but it's just an application of one of my favorite memes:

21.01.2025 19:07 โ€” ๐Ÿ‘ 499    ๐Ÿ” 65    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 7
Video thumbnail

agent swarm framework aces spatial reasoning test

25.12.2024 16:59 โ€” ๐Ÿ‘ 130    ๐Ÿ” 32    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 2
Post image

Inventors of flow matching have released a comprehensive guide going over the math & code of flow matching!

Also covers variants like non-Euclidean & discrete flow matching.

A PyTorch library is also released with this guide!

This looks like a very good read! ๐Ÿ”ฅ

arxiv: arxiv.org/abs/2412.06264

10.12.2024 08:35 โ€” ๐Ÿ‘ 109    ๐Ÿ” 26    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Building Machine Learning Systems for a Trillion Trillion Floating Point Operations
YouTube video by Jane Street Building Machine Learning Systems for a Trillion Trillion Floating Point Operations

Jane Street, a quant trading firm has a very good YouTube channel. For comparison, DeepSeek is also a quant trading firm.

They recently published a video on "Building Machine Learning Systems for a Trillion Trillion Floating Point Operations".

Link: www.youtube.com/watch?v=139U...

09.12.2024 17:26 โ€” ๐Ÿ‘ 36    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

AI Scientists: here is a technology that will automate your grunt work so you can spend more time with your kids

AI Ads: here is a technology that will automate spending time with your kids

03.12.2024 22:35 โ€” ๐Ÿ‘ 5    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

A dataset of 1 million or 2 million Bluesky posts is completely irrelevant to training large language models.

The primary usecase for the datasets that people are losing their shit over isn't ChatGPT, it's social science research and developing systems that improve Bluesky.

28.11.2024 18:57 โ€” ๐Ÿ‘ 252    ๐Ÿ” 39    ๐Ÿ’ฌ 8    ๐Ÿ“Œ 5
Post image

Arxiv sharing reminder

pdf โŒ
abs โœ…

26.11.2024 08:42 โ€” ๐Ÿ‘ 249    ๐Ÿ” 41    ๐Ÿ’ฌ 9    ๐Ÿ“Œ 2

In fact, statistical malpractice is the main driver of progress in machine learning. At some point, we need to come to terms with this.

22.11.2024 14:40 โ€” ๐Ÿ‘ 52    ๐Ÿ” 5    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 6
Preview
[RFC] Per-Parameter-Sharding FSDP ยท Issue #114299 ยท pytorch/pytorch Per-Parameter-Sharding FSDP Motivation As we looked toward next-generation training, we found limitations in our existing FSDP, mainly from the flat parameter construct. To address these, we propos...

Fsdp2 has a different policy for handling streams that is also worth a read
github.com/pytorch/pyto...

23.11.2024 10:49 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
French Revolution: Cyclists Now Outnumber Motorists In Paris Official measurements have found that Paris is rapidly becoming a city of cyclists.

READ: โ€œ3,337 Parisians were equipped with GPS trackers to record their journeysโ€ฆfor journeys from the outskirts of Paris to the center, the number of cyclists now far exceeds the number of motorists, a huge change from just 5 years ago.โ€

Evidence of leadership.
www.forbes.com/sites/carlto...

19.11.2024 19:11 โ€” ๐Ÿ‘ 1124    ๐Ÿ” 323    ๐Ÿ’ฌ 15    ๐Ÿ“Œ 71
Comparison table of various AI models across different benchmarks: Mathvista, MMMU, ChartQA, DocVQA, VQAv2, AI2D, and MM MT-Bench. Models are categorized into Open Weights, Closed, and Unreleased. Key models include Pixtral Large, Llama-3.2 90B, Gemini-1.5 Pro, GPT-4o, Claude-3.5 Sonnet, Llama-3.1 505B, and Grok-2. The table shows measured and reported performance scores, highlighting differences in model capabilities across various tasks. Pixtral Large excels in Mathvista, DocVQA, AI2D and MM MT-Bench benchmarks.

Comparison table of various AI models across different benchmarks: Mathvista, MMMU, ChartQA, DocVQA, VQAv2, AI2D, and MM MT-Bench. Models are categorized into Open Weights, Closed, and Unreleased. Key models include Pixtral Large, Llama-3.2 90B, Gemini-1.5 Pro, GPT-4o, Claude-3.5 Sonnet, Llama-3.1 505B, and Grok-2. The table shows measured and reported performance scores, highlighting differences in model capabilities across various tasks. Pixtral Large excels in Mathvista, DocVQA, AI2D and MM MT-Bench benchmarks.

Pixtral Large:
- 123B decoder, 1B vision encoder, 128K sequence length
- Frontier multimodal model
- Maintains text performance of Mistral Large 2

HF weights: huggingface.co/mistralai/Pi...
Try it: chat.mistral.ai
Blog post: mistral.ai/news/pixtral...

18.11.2024 17:56 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Two announcement cards from the Mistral AI team, dated November 18, 2024. The first card announces 'Mistral has entered the chat' with a brief description: 'Search, vision, ideation, coding... all yours for free.' The second card announces 'Pixtral Large' with the description: 'Pixtral grows up.' Both cards feature an orange 'Read More' button.

Two announcement cards from the Mistral AI team, dated November 18, 2024. The first card announces 'Mistral has entered the chat' with a brief description: 'Search, vision, ideation, coding... all yours for free.' The second card announces 'Pixtral Large' with the description: 'Pixtral grows up.' Both cards feature an orange 'Read More' button.

We have 2 new big updates today at Mistral:
- New Le Chat: With canvas, web search, image understanding and generation & more - and free!
- Pixtral Large, our Frontier 124B open weight multimodal model that powers it.

Try it: chat.mistral.ai
Blog post: mistral.ai/news/mistral...

18.11.2024 17:56 โ€” ๐Ÿ‘ 15    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Preview
Diffusion is spectral autoregression A deep dive into spectral analysis of diffusion models of images, revealing how they implicitly perform a form of autoregression in the frequency domain.

There seems to be some renewed interest in making this work in the ML/AI space, so I'm here as well ๐Ÿ‘‹

Here's my latest blog post for good measure, about how diffusion models of images perform autoregression in frequency space: sander.ai/2024/09/02/s...

When I write more, I'll share here as well!

15.11.2024 18:57 โ€” ๐Ÿ‘ 28    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 3

Quick thread in response to a question on token packing practices when pretraining LLMs!

07.11.2024 18:21 โ€” ๐Ÿ‘ 9    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@dlsq is following 20 prominent accounts