What are some great datasets hosted on Hugging Face? We just added a way to quickly import, embed, and build interactive visualizations from them. Would love to get some hidden gems in. E.g. here's 650,000 1933 newspaper articles from LOC, extracted by Melissa Dell. atlas.nomic.ai/data/nomic/a...
30.01.2025 18:20 β π 20 π 4 π¬ 0 π 0
I think thatβs true, and also people find ways to rationalize support for things they genuinely enjoy using/doing
14.01.2025 03:39 β π 1 π 0 π¬ 0 π 0
For Maurice DiMaggio, an electrical contractor, morning drives from Matawan, N.J., into Manhattan via the Lincoln Tunnel usually take two hours. Today's took only one. "I would rather have an hour not in the vehicle, in traffic," DiMaggio, 45, said from inside his work van. If congestion pricing proves a permanent "commute clipper" for DiMaggio (who is related via his
grandfather to Joltin' Joe), he is all for it. "I can't run my business off the subway, but maybe this whole congestion pricing thing is keeping people from driving."
every single time, congestion pricing becomes way more popular after itβs implemented
traffic sucks, but people refuse to believe it will go away until it actually does
06.01.2025 16:08 β π 399 π 54 π¬ 7 π 4
To absoluteβs credit, nowhere else I know of let you get a Thai iced tea along with your bagel. That was really something
13.12.2024 21:33 β π 2 π 0 π¬ 0 π 0
This is a real loss, but my undisputed UWS GOAT Broadway Bagel on 101 remains very much alive. Long live the egg everything BEC
13.12.2024 21:33 β π 2 π 0 π¬ 1 π 0
My sense is that both absolute and relative time could be meaningful over time - βwhat does decay look like for 10 year time gapsβ but also βwhat does the decay trajectory look like for sites from 2014?β
With enough data you could make a nice βMedusa chartβ showing both. Cc @toph.me
09.12.2024 06:03 β π 1 π 0 π¬ 0 π 0
You know a side project is getting out of hand when you start making a settings page before anyone has actually used it
05.12.2024 14:43 β π 2 π 0 π¬ 1 π 0
A heatmap of years and states, showing a large blue cluster in the center where Southern states vote Democratic. and other geographical patterns.
A heatmap of years and states in alphabetical order, showing no clear global structure
New blog post! Updated for 2024, my favorite example of why alphabetical ordering is bad for geographic features -- US presidential results since 1828. The left image shows regional patterns in a geographic ordering that the right (alphabetical) simply loses. benschmidt.org/post/2024-11...
01.12.2024 13:46 β π 127 π 21 π¬ 12 π 5
What a beautiful win. The look on Ryan's face in that final moment made the whole season worth it!!!
30.11.2024 21:07 β π 0 π 0 π¬ 0 π 0
Adventures in DuckDB + Huggingface + ArrowJS - If you try to stream arrow IPC out of HF with DuckDB, for some reason the batches come back in ~random order! ArrowJS decided to explode when this happens.
Not even sure who's at fault here, but excited for this ecosystem to continue to mature #databs
20.11.2024 16:46 β π 2 π 0 π¬ 0 π 0
Clippy, the godfather of generative AI
This problem was solved years ago
19.11.2024 21:02 β π 1 π 0 π¬ 1 π 0
TMD Web Team Alum Starter Pack π
18.11.2024 19:57 β π 2 π 0 π¬ 0 π 0
I am a broken record on this but LLM text embeddings are an incredible breakthrough, and the ability for anyone to build pretty good classifiers with structured output could be insanely useful.
Trying to build NLP interfaces is taking my team an extremely long time and is extremely brittle
13.11.2024 15:49 β π 20 π 4 π¬ 1 π 2
Wordle, 15 Million Tweets Later
Since the start of the year, the online word game Wordle has overtaken βcrosswordβ (by 10x), βolympicsβ (2x) and even βcovidβ (1.5x) in Google Trends data. News outlets cover the game down to each day...
Back in 2022 I published this post analyzing 15M tweets with Wordle results, with some fascinating results: observablehq.com/@rlesser/wor...
The hardest part by far was gathering a huge dataset from a platform hostile to such analysis. Very excited that Bluesky encourages this type of exploration!
13.11.2024 14:45 β π 4 π 0 π¬ 0 π 0
Great write up on how Val Town built Townie, probably the best LLM coding experience Iβve used.
A huge part of what makes it so nice is how little boilerplate/infrastructure the VT environment needs. Easier for people and easier for LLMs when everyone can focus on the business logic alone.
08.11.2024 22:07 β π 3 π 0 π¬ 0 π 0
One of the most exciting things Iβve worked on at Nomic.
There is huge untapped potential in taking people on a journey through a dataset, especially for text and image sets that are currently so hard to reason about
30.10.2024 17:04 β π 6 π 1 π¬ 0 π 0
This is now a force-directed-graph-posting account, all other content posted is anomalous behavior
24.05.2023 18:36 β π 1 π 0 π¬ 1 π 0
Dataviz at @civio.es Β· Passionate about #dataviz #maps #creativeCoding alwaysLearning Β· π©π»βπ»π³οΈβπ she/her
https://observablehq.com/user/@carmen-tm
Steelers beat writer at The Athletic.
Building @sidequery.dev
Director of Engineering atm.com
nicoritschel.com
Bot that posts the top trending notebook from https://observablehq.com/trending
source code online https://observablehq.com/@endpointservices/twitter-bot
advocating widespread dissatisfaction with computing.
Politics, math, culture, whatever.
A Mathematician dabbling in Data Science, especially unsupervised learning and data exploration. UMAP, HDBSCAN, PyNNDescent, DataMapPlot. (He/Him)
Jayvik shipper since season 1
todepond.com
β΅ London ΰ·΄ @tldraw.com
what's up sickos and freaks
Writer, The Ringer | Co-host, The Press Box & Ringer Tailgate | Former host, Slow Burn seasons 3, 6, 8 | Nationβs fastest 10-year-old in 1988 | Dez & Laniβs dad.
Endowed chair of the Tocqueville-Rand Freedom Enterprise Markets Innovation Center (disputed). Bound but not protected, I lie but I do not pretend. π° π
Applied scientist trying to make the internet a little better. PhD. Trust & safety, platform manipulation, networks, fingerstyle guitar. I use my hair to express myself. He/they