Thomas Ahle's Avatar

Thomas Ahle

@thomasahle.bsky.social

Head of AI @ NormalComputing. Tweets on Math, AI, Chess, Probability, ML, Algorithms and Randomness. Author of tensorcookbook.com

268 Followers  |  379 Following  |  29 Posts  |  Joined: 01.09.2024  |  1.9938

Latest posts by thomasahle.bsky.social on Bluesky

The "kombucha girl" meme, where the left panel (with her frowning) stating "According to the proposition" and the right one (showing her interested face) stating "Lemma tell you"

The "kombucha girl" meme, where the left panel (with her frowning) stating "According to the proposition" and the right one (showing her interested face) stating "Lemma tell you"

Mathematical writing is my passion.

11.01.2026 07:13 β€” πŸ‘ 53    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0
Post image

like this?

12.11.2025 10:09 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Added a new symbols menu - let me know if I missed any of your favourite LaTeX commands!

11.11.2025 00:02 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

I can't tell how much interest there is. But messages like this definitely encourage me to continue it!

24.04.2025 06:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
LaTeX to Image Effortlessly convert LaTeX math equations into high-quality images (PNG, JPEG, SVG).

I needed an easy way to make high resolution equations to post on Bluesky, so I made this: thomasahle.com/latex2png

16.03.2025 10:15 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

> If NATO hadn't been trying to expand there, there would have been no war.

There would.

> If NATO stops trying to expand into Ukraine, the war ends.

It wouldn't.

> If the US stops sending weapons and fomenting anti-Russian sentiment, the war ends.

This war is about territory not sentiment.

19.02.2025 21:19 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Tensorgrad Playground The Tensor Cookbook is a comprehensive guide to tensors, using the visual language of tensor diagrams. It closely follows the legendary 'Matrix Cookbook' while pr...

You can play around with expectations of higher order Gaussians using the new
tensorcookbook.com/playground

19.02.2025 21:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Isserlis' (or Wick's) theorem is one of the strongest tools to handle High Dimensional Gaussians.

Turns out it generalizes to _every distribution_ using cumulant tensors!

That's higher order variance, skewness, kurtosis, etc.

19.02.2025 21:14 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

I added a Playground to tensorcookbook.com for when you need that Matrix or Tensor Derivative in a hurry.

Hopefully it can also be a way to help people become familiar with tensor diagrams.

18.02.2025 08:01 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Now we're just waiting for a ZkiT model

12.02.2025 08:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Now live in a new Functions chapter in tensorcookbook.com

09.02.2025 11:24 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Some sketches for the next chapter

06.02.2025 10:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Post image

I added code execution to tensorcookbook.com so you can try tensorgrad's automatic tensor algebra without installing anything.

04.02.2025 15:58 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Professor Rasmus Pagh Receives International Recognition as ACM Fellow Professor Rasmus Pagh has been named a 2024 Fellow of the Association for Computing Machinery (ACM), a prestigious honor awarded to leading researchers in computing. This recognition highlights his si...

πŸŽ‰ Congratulations to Rasmus Pagh @rasmuspagh.net, the inventor of Cuckoo Hashing, and my PhD advisor, for becoming an ACM fellow! πŸŽ‰
di.ku.dk/english/news...

28.01.2025 16:35 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Tensor Product Attention illustrated with Tensor Diagrams

18.01.2025 14:00 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Neat one-page proof of "Stirling's bound"

(n/e)ⁿ√{2Ο€ n} ≀ n! ≀ (n/e)ⁿ(√{2Ο€ n}+1)

Inspired by the discussion on mathoverflow.net/a/458011/5429. Just had to keep hitting it with logarithmic inequalities...

18.12.2024 12:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yes please!

13.12.2024 07:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Poisson Probability Puzzle:

Let X ~ Poisson(πœ‡); Z = (X - πœ‡)/βˆšπœ‡; Y ~ Normal(0, 1).
How close is E[|X|^k] is to E[|Y|^k]?

Say we connect πœ‡ and k by πœ‡ = c kΒ³, what is now the limit E[|X|^k]/E[|Y|^k] as k β†’ ∞?

This was harder to solve than expected, but the answer was surprisingly pretty 🌻

12.12.2024 23:41 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

"Central Limit Theorem" for the Poisson Distribution

11.12.2024 10:45 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1
Thomas Dybdahl Ahle Thomas Dybdahl Ahle is a researcher in the theoretical foundations of machine learning and massive data, including similarity search, high dimensional geometry, kernel methods, sketching and derandomi...

A while ago Twitter removed the option of embedding your timeline on your website. Luckily, with Bluesky, I'm now able to put it back on thomasahle.com. Good to be back.

03.12.2024 08:54 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Can you refer me to the openai forum?

03.12.2024 06:28 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
History Heuristic - Chessprogramming wiki

For more information on history heuristics in chess, see www.chessprogramming.org/History_Heur...

03.12.2024 05:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
History Heuristic - Chessprogramming wiki

near future.

Time will tell if they'll update the entire network, or a smaller LoRA or side network.

Even chatbots like o1 could use TTT as an alternative to in context learning.

5/5

03.12.2024 05:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

while searching. If two subtrees are conceptually similar, it has to do all the work twice.

Test Time Training fixes this!
If AlphaZero updated its weights while searching, it could transfer learnings between the subtrees!

I'm sure we'll start seeing a lot of TTT architectures in the near...

4/5

03.12.2024 05:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Obviously having a pretrained cnt[from][to] array wouldn't be helpful at all in chess, as moves may be good or bad entirely dependent on the position.

But because the butterfly table is reset at every search, it encodes "local information".

AlphaZero meanwhile doesn't learn anything while...

3/5

03.12.2024 05:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Chess engines like Stockfish will keep a so-called butterfly board, keeping track of how often a move was chosen in the search tree. _Independently of the position_.

This is data is considered elsewhere in the search tree to decide how much time to spend considering the move.

Why do this?

2/5

03.12.2024 05:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Test Time Training promises to finally unify learning and search. As always, chess is a good place to study such ideas:

AlphaZero generalized and simplified most of the tricks in chess engines like Stockfish, but one category is missing: history heuristics...

1/5

03.12.2024 05:42 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Your o1 supports images?

30.11.2024 19:40 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Making a wiki style website is a good way to do this, while encouraging others from. The community to contribute and keep it updated.

In fact, writing good Wikipedia articles for your field might be the best way to spread this knowledge.

30.11.2024 15:51 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Clever use of the KV-cache: Writing in the margins (arxiv.org/abs/2408.14906) at Neurips next week.

By "taking notes" as you read, ypu reduce the complexity from N^3 (N tokens at N^2 cost) to N^3/3 (1+4+9+...+N^2).

29.11.2024 16:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@thomasahle is following 20 prominent accounts