Ben Walker's Avatar

Ben Walker

@benjamincwalker.bsky.social

πŸŽ“ Machine Learning PhD 🌍 Mathematical Institute, Oxford πŸ“ˆ Researching Neural Differential Equations & Rough Path Theory πŸ“§ Email: MLBenjaminWalker@gmail.com 🌐 GitHub: Benjamin-Walker

32 Followers  |  58 Following  |  11 Posts  |  Joined: 20.10.2024  |  1.7824

Latest posts by benjamincwalker.bsky.social on Bluesky


Really enjoyed chatting with @oxfordmathematics.bsky.social
about AI in mathematics, where it can genuinely help, and what some of the limitations are.

13.02.2026 07:56 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Alternative Title: β€˜A Timely Series of Talks on Time Series.’

Was too proud of this one so had to post it somewhere!

03.03.2025 17:54 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Just wrapped up my short course β€˜Time Series Modelling: From Foundations to Frontiers’ at the Oxford Internet Institute @oii.ox.ac.uk

Huge thanks to @ammaox.bsky.social for the invitationβ€”had some really engaging discussions!

Looking forward to being back at the OII soon.

03.03.2025 17:54 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Sonnet-3.7 has me vibe coding for the first time πŸŽ§πŸ’»

Never written html, css, or JavaScript before, but I’ve created the website I’ve always wanted, featuring an optional command line interface ✨

BenWalker.co.uk

25.02.2025 13:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Looking forward to presenting this work at #NeurIPS2024 !

Come find us on Thursday from 11-2 @ West Ballroom A-D #6907

09.12.2024 23:19 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

D&D Combinatorics xkcd.com/3015

23.11.2024 00:59 β€” πŸ‘ 26879    πŸ” 2156    πŸ’¬ 201    πŸ“Œ 109

Huge thanks to my incredible co-authors Nicola Cirone, Antonio Orvieto, Cristopher Salvi, and Terry Lyons!

#NeurIPS2024 #MachineLearning #DeepLearning #StateSpaceModels

🧡6/6

23.11.2024 09:04 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

S4, Mamba, and Transformers need 4 blocks just to compose 12 permutations!

In contrast, using a dense state-transition matrix (IDS4/Linear CDE) or a non-linear state-transition (RNN) allows for state-tracking with only 1 layer.

🧡5/6

23.11.2024 09:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

An excellent empirical example of this limited capacity is the A5 benchmark, from β€œThe Illusion of State in State-Space Models” by Merrill et al.

The benchmark tests state-tracking, a crucial ability for tasks involving permutation composition like chess.

The results? πŸ‘‡

🧡4/6

23.11.2024 09:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

We rigorously show that Mamba’s selectivity mechanism boosts expressiveness.

However, we also show that using a diagonal state-transition matrixβ€”while drastically reducing computational costsβ€”also significantly limits the model's capacity.

🧡3/6

23.11.2024 09:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

In this paper, we introduce a unified framework for state-space models using Rough Path Theory, providing a rigorous theoretical foundation for why the Mamba recurrence outperforms other SSMsβ€”and precisely where their expressiveness may be limited.

🧡2/6

23.11.2024 09:04 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Theoretical Foundations of Deep Selective State-Space Models Structured state-space models (SSMs) such as S4, stemming from the seminal work of Gu et al., are gaining popularity as effective approaches for modeling sequential data. Deep SSMs demonstrate outstan...

Want to know why Mamba beats other state-space modelsβ€”and where it falls short?

Then check out our #NeurIPS 2024 paper: "Theoretical Foundations of Deep Selective State-Space Models."

πŸ”— Read the paper: arxiv.org/abs/2402.19047
πŸ’» Access the code: github.com/Benjamin-Walke…

🧡1/6

23.11.2024 09:04 β€” πŸ‘ 21    πŸ” 2    πŸ’¬ 2    πŸ“Œ 1

@benjamincwalker is following 18 prominent accounts