Matt Rasmussen's Avatar

Matt Rasmussen

@mattrasmus.bsky.social

SVP Software Engineering, insitro Workflow engines, data engineering, genomics, phylogenetics, note taking. Web: http://mattrasmus.com/ ARGweaver: https://github.com/mdrasmus/argweaver redun: https://github.com/insitro/redun KeepNote: http://keepnote.org

162 Followers  |  463 Following  |  7 Posts  |  Joined: 03.02.2024  |  1.7137

Latest posts by mattrasmus.bsky.social on Bluesky

Post image

We are excited to share GPN-Star, a cost-effective, biologically grounded genomic language modeling framework that achieves state-of-the-art performance across a wide range of variant effect prediction tasks relevant to human genetics.
www.biorxiv.org/content/10.1...
(1/n)

22.09.2025 05:29 β€” πŸ‘ 174    πŸ” 90    πŸ’¬ 4    πŸ“Œ 5
Build systems are awesome, terrifying, and unloved. They are used by every developer around the world, but are rarely the object of study. In this paper we offer a systematic, and executable, framework for developing and comparing build systems, viewing them as related points in landscape rather than as isolated phenomena.

By teasing apart existing build systems, we can recombine their components, allowing us to prototype new build systems with desired properties.

Build systems are awesome, terrifying, and unloved. They are used by every developer around the world, but are rarely the object of study. In this paper we offer a systematic, and executable, framework for developing and comparing build systems, viewing them as related points in landscape rather than as isolated phenomena. By teasing apart existing build systems, we can recombine their components, allowing us to prototype new build systems with desired properties.

Finally got around to reading Build Systems Γ  la Carte (dl.acm.org/doi/pdf/10.1...), it's really great (like everyone says).

I love papers like this, that take a well-known, messy, unloved area and propose a useful organizing principle for it.

22.09.2025 06:27 β€” πŸ‘ 21    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

8/ Absolutely DETERMINED to make mutable collections key in a dict? Make them the default parameter of a lambda:

>>> {{}: {}}
TypeError: unhashable type: 'dict'
>>> x = lambda y={}: y
>>> x()[1] = 2
>>> x()
{1: 2}
>>> {x: {}}
{<function <lambda> at 0x000001E45E80E440>: {}}

02.04.2025 18:30 β€” πŸ‘ 42    πŸ” 7    πŸ’¬ 2    πŸ“Œ 3
Explicit note types that match and advance stages of sensemaking.

Explicit note types that match and advance stages of sensemaking.

β€œPatterns of Hypertext-Augmented Sensemaking” by Zhu et al. (2024)

dl.acm.org/doi/pdf/10.1...

23.03.2025 19:29 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Text screenshot:
"Ventures?"
"β€”Please choose an optionβ€”"
[group group-801]
[object Object]
[/group]
"Sign me up for your mailing list"

Text screenshot: "Ventures?" "β€”Please choose an optionβ€”" [group group-801] [object Object] [/group] "Sign me up for your mailing list"

This field is mandatory. I don’t know what it means.

Waste my time, I’m gonna waste yours.

18.02.2025 05:00 β€” πŸ‘ 165    πŸ” 10    πŸ’¬ 7    πŸ“Œ 0

As our debut Bluesky post, we’re excited to share our new paper (first author Zachary McCaw) in HGG Advances on scrutinizing the practice of using a ratio trait (numerator / denominator) for GWAS. www.cell.com/hgg-advances...

11.02.2025 14:41 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Our insitrocytes are the heartbeat of our shared vision. Together, as builders, scientists, engineers, we solve complex challenges every day, laser-focused on delivering better medicines for patients. Hear directly from some of our colleagues about how we work.
Visit www.insitro.com/people/.

12.02.2025 16:17 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Program

0900-1000: Compilers

    Database query compilation: our journey
    Viktor Leis & Thomas Neumann
    A YJIT interview
    Maxime Chevalier-Boisvert
    Can we democratize JIT compilers?
    Haoran Xu
    Safe and productive performance with user-schedulable languages
    Jonathan Ragan-Kelley

1000-1100: Understanding programs

    How debuggers work
    Sy Brand
    Debugging compiler-optimized code: how it works and doesn't
    Stephen Kell
    Side-Eye: ask your programs anything
    Andrei Matei
    Let’s run a million benchmarks
    Yao Yue
    Rocket science of simulation testing!
    Aleksey Kladov

1100-1200: Wild ideas

    Back to modularity
    Daniel Jackson
    DB usability: as if
    Jonathan Edwards
    Twizzler and far out memory sharing: precise abstractions
    Daniel Bittman
    Programming without pointers
    Andrew Kelley
    Throwing it all away - how extreme rewriting changed the way I build databases
    Tyler Neely

1200-1230: Programmers are people

    A case for feminism in programming language design
    Felienne Hermans
    Malloy, mic drop, peace!
    Michael Toy

1230-1300: Lightning talk buffet

1300-1400: Query languages

    ???
    Jeff Shute
    ???
    Tobias Brandt
    ???
    Yury Selivanov
    AquaLang: a dataflow programming language
    Klas Segeljakt
    A polymorphic data model for SQL using algebraic types
    Steve McCanne

1400-1430: Databases

    Use of time in distributed databases β€”don’t fall behind the times
    Murat Demirbas
    ???
    Thomas Ballinger
    Thank goodness we're not building a database
    Peter van Hardenburg

1430-1500: Wasm

    Thinking in wit
    Dan Gohman
    Bringing the WebAssembly standard up to speed with SpecTec
    Dongjun Youn

Program 0900-1000: Compilers Database query compilation: our journey Viktor Leis & Thomas Neumann A YJIT interview Maxime Chevalier-Boisvert Can we democratize JIT compilers? Haoran Xu Safe and productive performance with user-schedulable languages Jonathan Ragan-Kelley 1000-1100: Understanding programs How debuggers work Sy Brand Debugging compiler-optimized code: how it works and doesn't Stephen Kell Side-Eye: ask your programs anything Andrei Matei Let’s run a million benchmarks Yao Yue Rocket science of simulation testing! Aleksey Kladov 1100-1200: Wild ideas Back to modularity Daniel Jackson DB usability: as if Jonathan Edwards Twizzler and far out memory sharing: precise abstractions Daniel Bittman Programming without pointers Andrew Kelley Throwing it all away - how extreme rewriting changed the way I build databases Tyler Neely 1200-1230: Programmers are people A case for feminism in programming language design Felienne Hermans Malloy, mic drop, peace! Michael Toy 1230-1300: Lightning talk buffet 1300-1400: Query languages ??? Jeff Shute ??? Tobias Brandt ??? Yury Selivanov AquaLang: a dataflow programming language Klas Segeljakt A polymorphic data model for SQL using algebraic types Steve McCanne 1400-1430: Databases Use of time in distributed databases β€”don’t fall behind the times Murat Demirbas ??? Thomas Ballinger Thank goodness we're not building a database Peter van Hardenburg 1430-1500: Wasm Thinking in wit Dan Gohman Bringing the WebAssembly standard up to speed with SpecTec Dongjun Youn

The program is up and tickets are live at www.hytradboi.com/2025/#program.

I'll keep updating this thread as the last 10 talks hit camera-ready.

01.02.2025 02:46 β€” πŸ‘ 50    πŸ” 27    πŸ’¬ 1    πŸ“Œ 7

Such a package index would encourage open source contribution of prompts plus pre-made code fragments

26.01.2025 07:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This might really help when building apps using coding assistants. Having assistants write so much code from scratch still feels unpredictable. What I really want is to be able to suggest from an index of code fragments and prompts which things to glue into my app.

26.01.2025 07:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

But if you wanted to package up a partial table schema and some prompt about how it could be rendered in a UI, then the package manager+coding assistant could auto merge the partial schema into the user's existing schema and even make the right UI updates to show the new fields

26.01.2025 07:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Libraries and packages indexes (especially language specific ones) have been a boon for code reuse. But they can only be used for code that can be cleanly abstracted. I often encounter chunks of code I wish could reuse as a lib, but I can't because I can't abstract it well enough

26.01.2025 07:15 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I've been thinking about what libs and pkg indexes might look like as coding assistants become the norm. What if instead of packages providing a post_install.sh script (to regex config, add symlinks), they had a post_install.md prompt to help integrate the code into your app.

26.01.2025 07:15 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Building Bauble I made something that I think is pretty neat, and I want to tell you about it. This is a little hot air balloon made out of alternating layers of brass and bronze that lock together with these angle...

building bauble

ianthehenry.com/posts/bauble...

10.01.2025 22:21 β€” πŸ‘ 128    πŸ” 30    πŸ’¬ 6    πŸ“Œ 13
Preview
Jupyter Kernel Architecture - Blog by Roman Glushko How Jupyter executes, inspects, completes and debug code

I stumbled upon this excellent post on the @projectjupyter.bsky.social kernel architecture, that is probably the nicest narrative writeup out there I've seen on the topic (please let me know of others!). I'm going to make a PR on our docs to link to it:

www.romaglushko.com/blog/jupyter...

11.01.2025 23:29 β€” πŸ‘ 43    πŸ” 9    πŸ’¬ 1    πŸ“Œ 0

OK. Practical question times. How are you adjusting your research given progress in reasoning style models? Also how are you adjusting the way you work?

22.12.2024 07:39 β€” πŸ‘ 60    πŸ” 4    πŸ’¬ 11    πŸ“Œ 1
Preview
SQL Workbench - Rapid prototyping SQL Queries & Data Visualizations An online SQL Workbench based on DuckDB that can query and visualize remote CSV, JSON, Parquet and Arrow data, as well as local files.

I rebuilt sql-workbench.com over Christmas.

It now uses CodeMirror instead of Monaco, thus working on mobile now, too. As well as an update to Perspective v3 and moving to Shadcn UI sidebar from a custom one…

28.12.2024 20:27 β€” πŸ‘ 35    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
Groove Salad from SomaFM A nicely chilled plate of ambient/downtempo beats and grooves.

current vibe: somafm.com/groovesalad/

28.12.2024 23:19 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 2    πŸ“Œ 0
Preview
AI-generated tools can make programming more fun

New blog post! About how using AI to build custom debug views can make it more fun to do the programming ourselves.

www.geoffreylitt.com/2024/12/22/m...

22.12.2024 21:47 β€” πŸ‘ 36    πŸ” 4    πŸ’¬ 2    πŸ“Œ 2
Diagram with large number: 2.7.123
First β€œ2” is commented: Proud version. Bump when you are proud of the release
Second β€œ7” is commented: Default version. Just normal/okay releases
Third β€œ123” is commented: Shame version. Bump when fixing things too embarrassing to admit

Diagram with large number: 2.7.123 First β€œ2” is commented: Proud version. Bump when you are proud of the release Second β€œ7” is commented: Default version. Just normal/okay releases Third β€œ123” is commented: Shame version. Bump when fixing things too embarrassing to admit

I propose we replace semantic versioning with pride versioning

21.12.2024 19:07 β€” πŸ‘ 2558    πŸ” 746    πŸ’¬ 34    πŸ“Œ 52
Preview
Tree Calculus One operator. Trivial semantics. Turing complete. Intensional.

So, this is cool:

treecalcul.us

A very simple intensional calculus. A simple, privileged programming language where functions can be analyzed, serialized, etc.

10.12.2024 00:46 β€” πŸ‘ 71    πŸ” 12    πŸ’¬ 7    πŸ“Œ 0
Post image

Foursquare places data is live in the hive 🐝 🍯

@hachej.bsky.social @seifert.blue

30.11.2024 00:04 β€” πŸ‘ 38    πŸ” 7    πŸ’¬ 6    πŸ“Œ 1

A lot of no-code lab software (LIMS, automation scheduling) suffers from trying to reinvent a software practice concepts in a GUI. Things like branches, test environments, API integrations are either not supported or reinvented in strange ways.

27.11.2024 21:30 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Implicit in the design of most live and visual programming interfaces is the ethos β€œif everyone would just pull the rest of the world into my environment…”

27.11.2024 20:52 β€” πŸ‘ 11    πŸ” 1    πŸ’¬ 3    πŸ“Œ 1
Post image

I'm reviewing LLM frameworks and every. single. one.:

- declares that a prompt is some function of state that produces an output
- then invents a convoluted DSL

So why is this not the obvious way to do it?
- self-documenting
- typed inputs and outputs
- easy separation of template from context

27.11.2024 00:13 β€” πŸ‘ 16    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Preview
Introduction - Model Context Protocol Get started with the Model Context Protocol (MCP)

Anthropic released an interesting thing today: an attempt at a standard protocol for LLM tools to talk to services that provide tools and extra context to be used other the models modelcontextprotocol.io

25.11.2024 16:37 β€” πŸ‘ 378    πŸ” 52    πŸ’¬ 19    πŸ“Œ 6
Preview
DASL β€” Data-Addressed Structures & Links A small set of simple, standard primitives to work on content-addressed data.

Have you ever wanted to use content-addressing primitives like CIDs or something IPLD/CBOR-like but you found the optionality in IPFS a little unwieldy and expensive to implement? Or, you know, interested in ATProto?

We've stared drafting a little thing for you: dasl.ing

25.11.2024 14:49 β€” πŸ‘ 118    πŸ” 35    πŸ’¬ 12    πŸ“Œ 5



go.bsky.app/UZB3Ndi

24.11.2024 03:38 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

my braindump on how Observable Notebooks work at the runtime level
observablehq.com/@tomlarkwort...

23.11.2024 10:11 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Preview
Reconcile All The Things Visual programming for coders.

Another nice blog post comparing workflow engine compute to the logic React uses to cache UI changes
acko.net/blog/reconci...

26.11.2024 05:57 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@mattrasmus is following 19 prominent accounts