Joe Hamman's Avatar

Joe Hamman

@jhamman.bsky.social

Climate scientist, engineer and human. Co-founder/CTO @earthmoverHQ. @xarray_dev / @pangeo_data dev. Formerly @carbonplanorg, @NCAR_Science, and @uw_cee. He/him.

577 Followers  |  117 Following  |  52 Posts  |  Joined: 22.11.2023  |  2.2272

Latest posts by jhamman.bsky.social on Bluesky

Super excited to see #icechunk v1.0 ship today. Stable format, stable API, and ready for production. Take it for a spin and let us know how it goes. ๐Ÿš€๐Ÿš€๐Ÿš€

10.07.2025 19:15 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Just back from #CNG2025, I wrote up some thoughts on the traction the @zarr.dev format is gaining in the broader geospatial industry (specifically Earth Observation). I also summarize what features and tools the user and developer communities are asking for to fully leverage Zarr. ๐Ÿ‘‡๐Ÿ‘‡๐Ÿ‘‡

07.05.2025 15:58 โ€” ๐Ÿ‘ 3    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I'll be at the CNG conference in Snowbird next week. I wrote a short blog post about what the Earthmover team will be up to.

tldr; we'll be talking about @zarr.dev, #icechunk, @xarray.bsky.social and cloud-native data cubes.

Details in the blog post ๐Ÿ‘‡

28.04.2025 20:19 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Most people think of @zarr.dev as a "file format". With #Icechunk, we've turned Zarr into a database. @functionth.bsky.social's post shows how Icechunk can be used to solve a problem where transactional databases are often required.

24.04.2025 18:40 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Learning about Icechunk consistency with a clichรฉd but instructive example - Earthmover In this post weโ€™ll show what can happen when more than one process write to the same Icechunk repository concurrently, and how Icechunk uses transactions and conflict resolution to guarantee consisten...

1/ ๐Ÿšจ New Blog Post Alert: "๐ฟ๐‘’๐‘Ž๐‘Ÿ๐‘›๐‘–๐‘›๐‘” ๐ด๐‘๐‘œ๐‘ข๐‘ก ๐ผ๐‘๐‘’๐‘โ„Ž๐‘ข๐‘›๐‘˜ ๐ถ๐‘œ๐‘›๐‘ ๐‘–๐‘ ๐‘ก๐‘’๐‘›๐‘๐‘ฆ ๐‘ค๐‘–๐‘กโ„Ž ๐‘Ž ๐ถ๐‘™๐‘–๐‘โ„Ž๐‘’ฬ๐‘‘ ๐‘๐‘ข๐‘ก ๐ผ๐‘›๐‘ ๐‘ก๐‘Ÿ๐‘ข๐‘๐‘ก๐‘–๐‘ฃ๐‘’ ๐ธ๐‘ฅ๐‘Ž๐‘š๐‘๐‘™๐‘’" ๐Ÿฆ๐Ÿ”

๐Ÿ‘‰ Read it here: earthmover.io/blog/learnin...

23.04.2025 15:30 โ€” ๐Ÿ‘ 5    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 5

There's still time to sign up for tomorrow's webinar on โšกFlux โšก

๐Ÿ‘‡
share.hsforms.com/1FLrE9CpwSLq...

21.04.2025 18:58 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@zarr.dev and #icechunk are amazing but they are not magic. They are part of a thoughtfully designed cloud-native data architecture. @tegnicholas.bsky.social peels back the covers on cloud-optimized scientific data formats in our latest "Fundamentals" post ๐Ÿ‘‡

21.04.2025 17:12 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

We found similar results when we first benchmarked #icechunk. Our conclusion: doing IO with a Rust backend is much faster than Python.

๐Ÿ‘‡Really exciting to see @kylebarron.dev's Obstore backend for Zarr-Python ship today.

21.04.2025 16:52 โ€” ๐Ÿ‘ 11    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Announcing Flux: The API Layer for Geospatial Data Delivery - Earthmover TLDR Earthmoverโ€™s new productโ€“Fluxโ€“adds a whole new layer to our platform. Flux allows you to serve geospatial data from Arraylake via standard API protocolsโ€“including WMS (web map service), EDR (envi...

๐Ÿ“ More details: earthmover.io/blog/announc...
๐Ÿ“… Webinar April 22: share.hsforms.com/1FLrE9CpwSLq...
๐Ÿ’ฌ DMs open โ€” happy to chat or schedule a demo.

16.04.2025 15:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

No more bespoke APIs. No glue code.
Just fast, interoperable access via tools you already use โ€” QGIS, ArcGIS, web maps, Python, R, MATLAB, and more.
Flux is a cheat code for building data products.

16.04.2025 15:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Flux serves data directly from Icechunk via standard APIs like EDR, WMS, and OPeNDAP.
Itโ€™s stateless, scalable, and cloud-native.
From raw data to production-grade API endpoints in seconds.

16.04.2025 15:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

We built Flux to solve a core pain point: turning massive geospatial datasets into usable APIs was slow, brittle, and distracting.
Flux makes it trivial for any data producer to become a data provider.

16.04.2025 15:10 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

๐Ÿš€ Big launch today: introducing Flux โ€” a game-changing way to serve geospatial data via standard APIs in seconds. Built for scale, speed, and simplicity.

16.04.2025 15:10 โ€” ๐Ÿ‘ 5    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Training AI models at scale from data stored in cloud object storage requires thinking carefully about both bandwidth and concurrency. In this post, @functionth.bsky.social getโ€™s into the details of concurrent reads at scale, showing how #Icechunk and S3 can easily scale beyond 200k requests/second!

09.04.2025 19:55 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Exploring Icechunk scalability: untangling S3's prefix story | Earthmover We show Icechunk can scale to extremely high concurrency levels, and explain how it achieves this in modern object stores.

๐Ÿ“ฃย Blog post alert! ๐„๐ฑ๐ฉ๐ฅ๐จ๐ซ๐ข๐ง๐  ๐ˆ๐œ๐ž๐œ๐ก๐ฎ๐ง๐ค ๐ฌ๐œ๐š๐ฅ๐š๐›๐ข๐ฅ๐ข๐ญ๐ฒ: ๐ฎ๐ง๐ญ๐š๐ง๐ ๐ฅ๐ข๐ง๐  ๐’๐Ÿ‘'๐ฌ ๐ฉ๐ซ๐ž๐Ÿ๐ข๐ฑ ๐ฌ๐ญ๐จ๐ซ๐ฒ. This technical post by @functionth.bsky.social dives deep into the internals of how S3 shards data, showing that distributed Icechunk can easily perform 230,000 object reads/sec and beyond. earthmover.io/blog/explori...

09.04.2025 15:27 โ€” ๐Ÿ‘ 5    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3

Weโ€™ll look into it!

04.04.2025 02:00 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

We often see folks try to convince tabular data tools to perform well with multi-dimensional array data. This post by @rabernat.bsky.social explains, from first principles, why this rarely works. Its a good one! ๐Ÿ‘‡๐Ÿ‘‡๐Ÿ‘‡

03.04.2025 21:10 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Super exciting to see @copernicusecmwf.bsky.social starting to adopt @zarr.dev!

02.04.2025 22:52 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

๐Ÿšจ New blog post ๐Ÿšจ

In it, we show off our recent work deploying #icechunk on top of #NASA's existing archives of Earth observation data. The results: 100x speed up when extracting time series from existing datasets stored as netCDF.

28.03.2025 15:30 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1

This session is going to be a blast! If you are headed to CNG next month (and you should be!), consider joining us for this workshop on @xarray.bsky.social , @zarr.dev , and #icechunk. ๐Ÿ‘‡๐Ÿ‘‡๐Ÿ‘‡

25.03.2025 19:50 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

1/ As a data platform company, we get a unique view of how other companies use data. In light of the recent cuts at @noaa.gov, our CEO, @rabernat.bsky.social, reached out to our customers to get their perspective on the value of NOAAโ€™s data and services.

18.03.2025 15:07 โ€” ๐Ÿ‘ 10    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

I'll be giving a short update tomorrow on the state of Zarr-Python following the 3.0 release back in January. Swing by the Pangeo showcase to catch the latest progress.

@zarr.dev

05.03.2025 01:22 โ€” ๐Ÿ‘ 6    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Denver Geospatial Happy Hour ยท Luma Come have a drink, grab a bite, and nerd out with the Earthmover team on geospatial data. Earthmover will provide food and drink tickets. Our event will beโ€ฆ

Denver folks โ€” join us for a Geospatial Happy Hour on Wednesday, March 12th at 7 PM MST. Come enjoy drinks, tacos, discussing geospatial data, and meeting the Earthmover team. Register: lu.ma/o6r39p6j

27.02.2025 21:41 โ€” ๐Ÿ‘ 5    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Slack

We now have a public slack channel for VirtualiZarr development and questions, feel free to join!

earthmover-community.slack.com/archives/C08...

27.02.2025 15:40 โ€” ๐Ÿ‘ 5    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Very, very stoked to have @tegnicholas.bsky.social join the Earthmover team!

27.02.2025 16:06 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

I'm curious what sort of issues you are seeing. I'm able to open datasets on s3 without any problems.

24.02.2025 21:24 โ€” ๐Ÿ‘ 0    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Pre race team photo

Pre race team photo

I rode the race with my good friend Nico! Together, we rode about 320 miles over 24 hours! www.strava.com/activities/1...

24.02.2025 17:21 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
24HOP Open Men Duo Podium

24HOP Open Men Duo Podium

Here on Bluesky, I follow two topics: 1) software & data infrastructure and 2) cycling. I mostly post about software but today, I can share a bit about a fun cycling adventure of my own ๐Ÿšตโ€โ™‚๏ธ.

Last week, I got to ride in the 24 Hours in the Old Pueblo - placing 4th in the Open Men Duo category:

24.02.2025 17:21 โ€” ๐Ÿ‘ 15    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0

Zarr-Python 3 included a bunch of performance improvements that are now landing in @xarray.bsky.social. Check out @davisvbennett.bsky.social's post on what we changed in Xarray/Zarr and how it is speeding up the "time to first byte" for large datasets ๐Ÿ‘‡

24.02.2025 16:56 โ€” ๐Ÿ‘ 4    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@jhamman is following 20 prominent accounts