We are excited to share GPN-Star, a cost-effective, biologically grounded genomic language modeling framework that achieves state-of-the-art performance across a wide range of variant effect prediction tasks relevant to human genetics.
www.biorxiv.org/content/10.1...
(1/n)
22.09.2025 05:29 β π 174 π 90 π¬ 4 π 5
Build systems are awesome, terrifying, and unloved. They are used by every developer around the world, but are rarely the object of study. In this paper we offer a systematic, and executable, framework for developing and comparing build systems, viewing them as related points in landscape rather than as isolated phenomena.
By teasing apart existing build systems, we can recombine their components, allowing us to prototype new build systems with desired properties.
Finally got around to reading Build Systems Γ la Carte (dl.acm.org/doi/pdf/10.1...), it's really great (like everyone says).
I love papers like this, that take a well-known, messy, unloved area and propose a useful organizing principle for it.
22.09.2025 06:27 β π 21 π 8 π¬ 1 π 0
8/ Absolutely DETERMINED to make mutable collections key in a dict? Make them the default parameter of a lambda:
>>> {{}: {}}
TypeError: unhashable type: 'dict'
>>> x = lambda y={}: y
>>> x()[1] = 2
>>> x()
{1: 2}
>>> {x: {}}
{<function <lambda> at 0x000001E45E80E440>: {}}
02.04.2025 18:30 β π 42 π 7 π¬ 2 π 3
Explicit note types that match and advance stages of sensemaking.
βPatterns of Hypertext-Augmented Sensemakingβ by Zhu et al. (2024)
dl.acm.org/doi/pdf/10.1...
23.03.2025 19:29 β π 9 π 1 π¬ 0 π 0
Text screenshot:
"Ventures?"
"βPlease choose an optionβ"
[group group-801]
[object Object]
[/group]
"Sign me up for your mailing list"
This field is mandatory. I donβt know what it means.
Waste my time, Iβm gonna waste yours.
18.02.2025 05:00 β π 165 π 10 π¬ 7 π 0
As our debut Bluesky post, weβre excited to share our new paper (first author Zachary McCaw) in HGG Advances on scrutinizing the practice of using a ratio trait (numerator / denominator) for GWAS. www.cell.com/hgg-advances...
11.02.2025 14:41 β π 5 π 2 π¬ 0 π 0
Our insitrocytes are the heartbeat of our shared vision. Together, as builders, scientists, engineers, we solve complex challenges every day, laser-focused on delivering better medicines for patients. Hear directly from some of our colleagues about how we work.
Visit www.insitro.com/people/.
12.02.2025 16:17 β π 4 π 1 π¬ 0 π 0
Program
0900-1000: Compilers
Database query compilation: our journey
Viktor Leis & Thomas Neumann
A YJIT interview
Maxime Chevalier-Boisvert
Can we democratize JIT compilers?
Haoran Xu
Safe and productive performance with user-schedulable languages
Jonathan Ragan-Kelley
1000-1100: Understanding programs
How debuggers work
Sy Brand
Debugging compiler-optimized code: how it works and doesn't
Stephen Kell
Side-Eye: ask your programs anything
Andrei Matei
Letβs run a million benchmarks
Yao Yue
Rocket science of simulation testing!
Aleksey Kladov
1100-1200: Wild ideas
Back to modularity
Daniel Jackson
DB usability: as if
Jonathan Edwards
Twizzler and far out memory sharing: precise abstractions
Daniel Bittman
Programming without pointers
Andrew Kelley
Throwing it all away - how extreme rewriting changed the way I build databases
Tyler Neely
1200-1230: Programmers are people
A case for feminism in programming language design
Felienne Hermans
Malloy, mic drop, peace!
Michael Toy
1230-1300: Lightning talk buffet
1300-1400: Query languages
???
Jeff Shute
???
Tobias Brandt
???
Yury Selivanov
AquaLang: a dataflow programming language
Klas Segeljakt
A polymorphic data model for SQL using algebraic types
Steve McCanne
1400-1430: Databases
Use of time in distributed databases βdonβt fall behind the times
Murat Demirbas
???
Thomas Ballinger
Thank goodness we're not building a database
Peter van Hardenburg
1430-1500: Wasm
Thinking in wit
Dan Gohman
Bringing the WebAssembly standard up to speed with SpecTec
Dongjun Youn
The program is up and tickets are live at www.hytradboi.com/2025/#program.
I'll keep updating this thread as the last 10 talks hit camera-ready.
01.02.2025 02:46 β π 50 π 27 π¬ 1 π 7
Such a package index would encourage open source contribution of prompts plus pre-made code fragments
26.01.2025 07:15 β π 0 π 0 π¬ 0 π 0
This might really help when building apps using coding assistants. Having assistants write so much code from scratch still feels unpredictable. What I really want is to be able to suggest from an index of code fragments and prompts which things to glue into my app.
26.01.2025 07:15 β π 0 π 0 π¬ 1 π 0
But if you wanted to package up a partial table schema and some prompt about how it could be rendered in a UI, then the package manager+coding assistant could auto merge the partial schema into the user's existing schema and even make the right UI updates to show the new fields
26.01.2025 07:15 β π 0 π 0 π¬ 1 π 0
Libraries and packages indexes (especially language specific ones) have been a boon for code reuse. But they can only be used for code that can be cleanly abstracted. I often encounter chunks of code I wish could reuse as a lib, but I can't because I can't abstract it well enough
26.01.2025 07:15 β π 0 π 0 π¬ 1 π 0
I've been thinking about what libs and pkg indexes might look like as coding assistants become the norm. What if instead of packages providing a post_install.sh script (to regex config, add symlinks), they had a post_install.md prompt to help integrate the code into your app.
26.01.2025 07:15 β π 1 π 0 π¬ 1 π 0
Jupyter Kernel Architecture - Blog by Roman Glushko
How Jupyter executes, inspects, completes and debug code
I stumbled upon this excellent post on the @projectjupyter.bsky.social kernel architecture, that is probably the nicest narrative writeup out there I've seen on the topic (please let me know of others!). I'm going to make a PR on our docs to link to it:
www.romaglushko.com/blog/jupyter...
11.01.2025 23:29 β π 43 π 9 π¬ 1 π 0
OK. Practical question times. How are you adjusting your research given progress in reasoning style models? Also how are you adjusting the way you work?
22.12.2024 07:39 β π 60 π 4 π¬ 11 π 1
SQL Workbench - Rapid prototyping SQL Queries & Data Visualizations
An online SQL Workbench based on DuckDB that can query and visualize remote CSV, JSON, Parquet and Arrow data, as well as local files.
I rebuilt sql-workbench.com over Christmas.
It now uses CodeMirror instead of Monaco, thus working on mobile now, too. As well as an update to Perspective v3 and moving to Shadcn UI sidebar from a custom oneβ¦
28.12.2024 20:27 β π 35 π 3 π¬ 0 π 0
AI-generated tools can make programming more fun
New blog post! About how using AI to build custom debug views can make it more fun to do the programming ourselves.
www.geoffreylitt.com/2024/12/22/m...
22.12.2024 21:47 β π 36 π 4 π¬ 2 π 2
Diagram with large number: 2.7.123
First β2β is commented: Proud version. Bump when you are proud of the release
Second β7β is commented: Default version. Just normal/okay releases
Third β123β is commented: Shame version. Bump when fixing things too embarrassing to admit
I propose we replace semantic versioning with pride versioning
21.12.2024 19:07 β π 2558 π 746 π¬ 34 π 52
Tree Calculus
One operator. Trivial semantics. Turing complete. Intensional.
So, this is cool:
treecalcul.us
A very simple intensional calculus. A simple, privileged programming language where functions can be analyzed, serialized, etc.
10.12.2024 00:46 β π 71 π 12 π¬ 7 π 0
Foursquare places data is live in the hive π π―
@hachej.bsky.social @seifert.blue
30.11.2024 00:04 β π 38 π 7 π¬ 6 π 1
A lot of no-code lab software (LIMS, automation scheduling) suffers from trying to reinvent a software practice concepts in a GUI. Things like branches, test environments, API integrations are either not supported or reinvented in strange ways.
27.11.2024 21:30 β π 1 π 0 π¬ 0 π 0
Implicit in the design of most live and visual programming interfaces is the ethos βif everyone would just pull the rest of the world into my environmentβ¦β
27.11.2024 20:52 β π 11 π 1 π¬ 3 π 1
I'm reviewing LLM frameworks and every. single. one.:
- declares that a prompt is some function of state that produces an output
- then invents a convoluted DSL
So why is this not the obvious way to do it?
- self-documenting
- typed inputs and outputs
- easy separation of template from context
27.11.2024 00:13 β π 16 π 1 π¬ 1 π 1
Introduction - Model Context Protocol
Get started with the Model Context Protocol (MCP)
Anthropic released an interesting thing today: an attempt at a standard protocol for LLM tools to talk to services that provide tools and extra context to be used other the models modelcontextprotocol.io
25.11.2024 16:37 β π 378 π 52 π¬ 19 π 6
DASL β Data-Addressed Structures & Links
A small set of simple, standard primitives to work on content-addressed data.
Have you ever wanted to use content-addressing primitives like CIDs or something IPLD/CBOR-like but you found the optionality in IPFS a little unwieldy and expensive to implement? Or, you know, interested in ATProto?
We've stared drafting a little thing for you: dasl.ing
25.11.2024 14:49 β π 118 π 35 π¬ 12 π 5
go.bsky.app/UZB3Ndi
24.11.2024 03:38 β π 7 π 1 π¬ 0 π 0
my braindump on how Observable Notebooks work at the runtime level
observablehq.com/@tomlarkwort...
23.11.2024 10:11 β π 6 π 1 π¬ 1 π 0
Reconcile All The Things
Visual programming for coders.
Another nice blog post comparing workflow engine compute to the logic React uses to cache UI changes
acko.net/blog/reconci...
26.11.2024 05:57 β π 3 π 0 π¬ 0 π 0
Genomicist, computational biologist. Assistant professor @ MGH, HMS. Associate member @ Broad Institute
https://klab.is
π§π»βπ» AI Engineer in bioimaging π§¬
Bronx boy. Cubs fan. Dad, husband, writer, podcaster and cable news host.
The Sirensβ Call: How Attention Became the Worldβs Most Endangered Resource out now.
https://sirenscallbook.com/
Husband, dad, veteran, writer, and proud Midwesterner. 19th US Secretary of Transportation and former Mayor of South Bend.
ClickHouse is the fastest open-source OLAP database β‘
Download: http://clickhouse.com
GitHub: http://clickhou.se/GitHub
Slack: http://clickhouse.com/slack
software engineer at ClickHouse; prev: Figma, WePay
long-form βοΈ: https://expertofobsolescence.substack.com
AI, robotics, and other stuff. Currently AI @ agility robotics
Former Hello Robot, NVIDIA, Meta.
Writing about robots https://itcanthink.substack.com/
All opinions my own
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
More good things for everyone. Public sector appreciator. Tax and welfare policy knower. Hyperinflation doubter.
K8s SIG Security Co-Chair
container escape artist
aurora chaser
goose in the machine
Minneapolis. They/them.
Stay punk π΄
Statistical Genetics @insitro.bsky.social | PhD @emoryuniversity.bsky.social
Statistical genetics and machine learning @ insitro | previously goldfinchbio | mgh | uw
Interested in helping cure diseases; machine learning for chemistry and biology
ML research engineer, dad, widower, lover of cats and coffee
Biostatistician and machine learning scientist
Genomics, big data, open science, diversity. Director of the Centre for Population Genomics, focused on building a more equitable future for genomic medicine. Opinions my own.