Reuben Bond's Avatar

Reuben Bond

@rbn.bsky.social

Distributed Systems & databases person. Works at Microsoft on Orleans & Aspire

2,993 Followers  |  246 Following  |  114 Posts  |  Joined: 24.04.2023  |  1.9026

Latest posts by rbn.bsky.social on Bluesky

F3: The Open-Source Data File Format for the Future
SIGMOD 2025

F3: The Open-Source Data File Format for the Future SIGMOD 2025

Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
πŸ“„ Paper: db.cs.cmu.edu/papers/2025/...
πŸ“ Code: github.com/future-file-...

01.10.2025 13:49 β€” πŸ‘ 70    πŸ” 21    πŸ’¬ 4    πŸ“Œ 5
Post image

Why, hello there! πŸͺŸ

Thanks @rbn.bsky.social :D

16.09.2025 19:07 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

I have an Aer one. You can get decent Tomtoc ones which mimic the designs of popular brands for less $ - unsure in the quality but I imagine they are decent.

31.08.2025 23:11 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Async/await was a developer experience regression compared with blocking APIs, but a huge advancement over the APIs it replaced, which were callback based.
I'm optimistic about virtual threads as a best of both worlds, if we can retain the benefits of futures & structured concurrency

04.08.2025 17:02 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Ah, thanks. How much of it is attributable to BEAM itself, I wonder. Using AOT instead of the JIT cuts memory usage in .NET by about half, for ex. IIRC, the numbers below are steady state under load, not startup, but either way

04.08.2025 16:42 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks. Perhaps there is still low hanging fruit for elixir/phoenix/BEAM when it comes to reducing startup mem consumption. I'm sure it would still consume more than a Rust app given the nature of the runtimes

04.08.2025 16:30 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

What is MB RES?

04.08.2025 16:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Post image

Pat Helland delivered a presentation on this paper: hpts.ws/papers/2024/...

09.04.2025 16:06 β€” πŸ‘ 14    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Post image

In the last week I probably explained Rateless Set Reconciliation to a dozen other scientists. What an amazing paper and result, and already one year old.

07.04.2025 18:55 β€” πŸ‘ 51    πŸ” 6    πŸ’¬ 6    πŸ“Œ 2

It's just a toy / proof-of-concept, really, but it might be useful to refer to.

03.03.2025 17:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Reuben Bond - Orleans under the hood (Dotnetos Conference 2021) YouTube video by Dotnetos

Thank you, Jeremy! I gave a presentation about some of the design considerations in the serialization & RPC system: www.youtube.com/live/kgRag4E...

29.01.2025 04:32 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

All things agents. I'm interested in builder communities rather than per-product servers

04.01.2025 22:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

What's a good discord server for people building things in the AI/LLM space?

04.01.2025 22:08 β€” πŸ‘ 7    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Committing the first value is safe as-is, but by playing more with this I believe subsequent Fast Rounds require additional safety rules:
1. Proposers only propose updates to known-committed values
2. Acceptors require the value's version (akin to slot number) has increased or values are identical

03.01.2025 17:33 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Did he have his onion juice with him or a couple of pocket eggs? If not, he's ngmi

03.01.2025 07:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

gotta max out those T levels somehow

03.01.2025 04:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Feel free to dm

16.12.2024 20:38 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Is there a doc describing it?

15.12.2024 18:54 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Link?

15.12.2024 18:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

"nothing more" is a bit too far: CRDTs have limitations on their behavior. You can't implement just anything using CRDTs. Their utility is quite limited and hence they are not widely deployed within datacenter based apps (I know of none, and no one has provided examples yet)

15.12.2024 18:43 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It depends on the CRDT, but that's not my area of expertise. The journaled grains use event sourcing (modified for geo distributed environments), and I believe you could implement CRDTs on top fairly easily, but the API also exposes coordination/sync operations for consistency.

15.12.2024 18:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

What I described is not a CRDT. I am talking about ACID database transactions using optimistic concurrency control

15.12.2024 17:10 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

You can version the data and check for conflicts at commit time, similar to a database transaction with optimistic concurrency control.

15.12.2024 16:48 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

In the online case you can coordinate changes. In the offline case you cannot: you have divergent data replicas being updated independently. The changes need to be merged eventually and there is ambiguity as to how.

15.12.2024 16:41 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

That's CRDT territory, but the goalposts just shifted

15.12.2024 16:23 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Interesting workshop and a lovely community, would very much recommend!

15.12.2024 16:04 β€” πŸ‘ 8    πŸ” 7    πŸ’¬ 1    πŸ“Œ 0

The sources can change but each one is authoritative (eg for fx rates) so you can assign a sequence number to each update which can be used to ensure consistency

15.12.2024 16:14 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

One way would be to leave the import in a staging area until the user has completed their part of the workflow against a point in time snapshot. There is nothing you can do with a CRDT here which you cannot do without - there's no replication in this scenario

15.12.2024 15:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Why makes you say it's impossible without CRDTs? I don't see why it would be any more complex without CRDTs

15.12.2024 15:19 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Hi everyone! I'm a co-founder @dbos.dev, where we're building a serverless platform for highly reliable applications. I love conversations about databases, distributed systems, and anything technical. Thanks @qianli.dev for introducing me to Bluesky, and looking forward to meeting people here! πŸ¦‹

13.12.2024 00:58 β€” πŸ‘ 31    πŸ” 4    πŸ’¬ 0    πŸ“Œ 2

@rbn is following 19 prominent accounts