Robert Lesser's Avatar

Robert Lesser

@rlesser.bsky.social

Engineer at Nomic AI | building tools for seeing in a high-dimensional world

60 Followers  |  76 Following  |  15 Posts  |  Joined: 24.05.2023  |  1.8125

Latest posts by rlesser.bsky.social on Bluesky

Video thumbnail

What are some great datasets hosted on Hugging Face? We just added a way to quickly import, embed, and build interactive visualizations from them. Would love to get some hidden gems in. E.g. here's 650,000 1933 newspaper articles from LOC, extracted by Melissa Dell. atlas.nomic.ai/data/nomic/a...

30.01.2025 18:20 β€” πŸ‘ 20    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0

I think that’s true, and also people find ways to rationalize support for things they genuinely enjoy using/doing

14.01.2025 03:39 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
View trending community notebooks | ObservableThis notebook is a forkThis notebook is a forkThis notebook is a forkThis notebook is a forkThis notebook is a forkThis notebook is a forkThis notebook is... See trending notebooks that you can fork and build from. Get started quickly with inspiring community examples.

I think trending is still up? observablehq.com/trending

But it’s been massively hidden, I’m not sure if there’s a single public-facing link to it on the whole site. Definitely a bummer

13.01.2025 22:58 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
For Maurice DiMaggio, an electrical contractor, morning drives from Matawan, N.J., into Manhattan via the Lincoln Tunnel usually take two hours. Today's took only one. "I would rather have an hour not in the vehicle, in traffic," DiMaggio, 45, said from inside his work van. If congestion pricing proves a permanent "commute clipper" for DiMaggio (who is related via his
grandfather to Joltin' Joe), he is all for it. "I can't run my business off the subway, but maybe this whole congestion pricing thing is keeping people from driving."

For Maurice DiMaggio, an electrical contractor, morning drives from Matawan, N.J., into Manhattan via the Lincoln Tunnel usually take two hours. Today's took only one. "I would rather have an hour not in the vehicle, in traffic," DiMaggio, 45, said from inside his work van. If congestion pricing proves a permanent "commute clipper" for DiMaggio (who is related via his grandfather to Joltin' Joe), he is all for it. "I can't run my business off the subway, but maybe this whole congestion pricing thing is keeping people from driving."

every single time, congestion pricing becomes way more popular after it’s implemented

traffic sucks, but people refuse to believe it will go away until it actually does

06.01.2025 16:08 β€” πŸ‘ 399    πŸ” 54    πŸ’¬ 7    πŸ“Œ 4

To absolute’s credit, nowhere else I know of let you get a Thai iced tea along with your bagel. That was really something

13.12.2024 21:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

This is a real loss, but my undisputed UWS GOAT Broadway Bagel on 101 remains very much alive. Long live the egg everything BEC

13.12.2024 21:33 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

My sense is that both absolute and relative time could be meaningful over time - β€œwhat does decay look like for 10 year time gaps” but also β€œwhat does the decay trajectory look like for sites from 2014?”

With enough data you could make a nice β€œMedusa chart” showing both. Cc @toph.me

09.12.2024 06:03 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

You know a side project is getting out of hand when you start making a settings page before anyone has actually used it

05.12.2024 14:43 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
A heatmap of years and states, showing a large blue cluster in the center where Southern states vote Democratic. and other geographical patterns.

A heatmap of years and states, showing a large blue cluster in the center where Southern states vote Democratic. and other geographical patterns.

A heatmap of years and states in alphabetical order, showing no clear global structure

A heatmap of years and states in alphabetical order, showing no clear global structure

New blog post! Updated for 2024, my favorite example of why alphabetical ordering is bad for geographic features -- US presidential results since 1828. The left image shows regional patterns in a geographic ordering that the right (alphabetical) simply loses. benschmidt.org/post/2024-11...

01.12.2024 13:46 β€” πŸ‘ 127    πŸ” 21    πŸ’¬ 12    πŸ“Œ 5

What a beautiful win. The look on Ryan's face in that final moment made the whole season worth it!!!

30.11.2024 21:07 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Adventures in DuckDB + Huggingface + ArrowJS - If you try to stream arrow IPC out of HF with DuckDB, for some reason the batches come back in ~random order! ArrowJS decided to explode when this happens.

Not even sure who's at fault here, but excited for this ecosystem to continue to mature #databs

20.11.2024 16:46 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Clippy, the godfather of generative AI

Clippy, the godfather of generative AI

This problem was solved years ago

19.11.2024 21:02 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

TMD Web Team Alum Starter Pack πŸ‘€

18.11.2024 19:57 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I am a broken record on this but LLM text embeddings are an incredible breakthrough, and the ability for anyone to build pretty good classifiers with structured output could be insanely useful.

Trying to build NLP interfaces is taking my team an extremely long time and is extremely brittle

13.11.2024 15:49 β€” πŸ‘ 20    πŸ” 4    πŸ’¬ 1    πŸ“Œ 2
Wordle, 15 Million Tweets Later Since the start of the year, the online word game Wordle has overtaken β€œcrossword” (by 10x), β€œolympics” (2x) and even β€œcovid” (1.5x) in Google Trends data. News outlets cover the game down to each day...

Back in 2022 I published this post analyzing 15M tweets with Wordle results, with some fascinating results: observablehq.com/@rlesser/wor...

The hardest part by far was gathering a huge dataset from a platform hostile to such analysis. Very excited that Bluesky encourages this type of exploration!

13.11.2024 14:45 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
graph getRelationships does not behave as lexicon says it should Β· Issue #2919 Β· bluesky-social/atproto Describe the bug The lexicon for getRelationships states that at-identifers can be used in the query for both actor and others, and that the response will return dids as the actor. This is not true...

Probably the getRelationships endpoint not working correctly: github.com/bluesky-soci...

13.11.2024 12:42 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great write up on how Val Town built Townie, probably the best LLM coding experience I’ve used.

A huge part of what makes it so nice is how little boilerplate/infrastructure the VT environment needs. Easier for people and easier for LLMs when everyone can focus on the business logic alone.

08.11.2024 22:07 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

One of the most exciting things I’ve worked on at Nomic.

There is huge untapped potential in taking people on a journey through a dataset, especially for text and image sets that are currently so hard to reason about

30.10.2024 17:04 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

This is now a force-directed-graph-posting account, all other content posted is anomalous behavior

24.05.2023 18:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@rlesser is following 20 prominent accounts