mdsumner's Avatar

mdsumner

@mdsumner.bsky.social

maxing the pixels to glow meshes, R, Python, GDAL, virtual Zarr, coordinate systems, southern ocean research

288 Followers  |  376 Following  |  496 Posts  |  Joined: 30.10.2024  |  2.1537

Latest posts by mdsumner.bsky.social on Bluesky

Preview
Australia’s gambling harm is likely underreported – and authorities are still failing to act New research shows Australian gambling harms, including suicide, may have been systematically under reported.

"a lot like a protection racket for the gambling industry" is a very polite way of describing current state of gov't action.
#Auspol

Australia’s gambling harm is likely underreported – and authorities are still failing to act
theconversation.com/australias-g...

08.10.2025 03:07 — 👍 1    🔁 4    💬 0    📌 0

#rstats what do folks do when converting from raster to terra with saved data objects? active bindings is a possibility

github.com/ropensci/gra...

interested in any perspectives, I don't want to change the entire way the package works (I'm not the maintainer, just toying with ideas)

07.10.2025 23:00 — 👍 3    🔁 2    💬 0    📌 0
Post image

New blog post! kylebarron.dev/blog/fast-py...

This dives into performance pitfalls of Python bindings, why projects like Lonboard are so fast, and why I care about Apache Arrow so much. The post is high level and you don't need to know C/C++/Rust to understand it!

07.10.2025 14:05 — 👍 13    🔁 4    💬 1    📌 1

aye to that

07.10.2025 10:52 — 👍 0    🔁 0    💬 0    📌 0

The solution was to not use coord_sf(). I need to remember to not use sf in any project unless absolutely necessary.

07.10.2025 09:47 — 👍 0    🔁 1    💬 1    📌 0

in short:
it’s complicated and we’re fucked

#climatecrisis

07.10.2025 10:26 — 👍 5    🔁 1    💬 0    📌 0
Preview
Make fossil fuel exporters pay for the damage they are causing, not ordinary Australians Climate change is making natural disasters like fires, floods, heatwaves and drought worse, costing Australians billions every year. Make fossil fuel producers pay a climate disaster levy to help pay...

Make the fossil fuel barons pay, not the tax payers
#auspol

07.10.2025 10:21 — 👍 4    🔁 4    💬 0    📌 0

Great story on ABC #4Corners now on how online gambling companies operate, the damage they do to the community & how they keep the major political parties in the pockets.

06.10.2025 09:43 — 👍 88    🔁 34    💬 2    📌 3
Preview
Optimal GeoParquet Partitioning Strategy My quest for one-size-fits-all format

Definitely depends what you're up to! People are definitely experimenting with serving features directly from Parquet files for visualization (e.g., medium.com/center-for-c... )

06.10.2025 03:14 — 👍 2    🔁 2    💬 1    📌 1

is Scott Morrison in jail yet?

Impressive to outdo John Howard for evil

06.10.2025 10:23 — 👍 0    🔁 0    💬 0    📌 0

This evening, President Trump is ordering 400 members of the Texas National Guard for deployments to Illinois, Oregon, and other locations within the United States. No officials from the federal government called me directly to discuss or coordinate.

06.10.2025 01:44 — 👍 13505    🔁 4310    💬 707    📌 401
Video thumbnail

Watch | Israeli occupation forces have released new drone footage from Gaza, revealing the mass destruction and the erasure of every means of life in the Strip.

06.10.2025 00:52 — 👍 22    🔁 19    💬 2    📌 9

Amsterdam, we salute you!
#Auspol

06.10.2025 09:40 — 👍 9    🔁 4    💬 0    📌 0

The claim seems widely overstated because in the end it is about selecting the partitioning in order to push as much computation into storage layer as possible. Selecting the partitioning is mostly about access pattern just like db indexes

05.10.2025 15:45 — 👍 0    🔁 1    💬 0    📌 0

Oh yes I'm using gdalwarp here.

User draws on wmts and it looks at a .vrt file to match which geotiffs it overlays. That way I don't have to display the tifs.

Thinking of adding more crs options for the output and choice between geotiff and jpg2000. Just prototyping this

05.10.2025 10:38 — 👍 1    🔁 1    💬 0    📌 0
Preview
Why GeoParquet Is A Poor Cloud Native Format · cloudnativegeo cloud-optimized-geospatial-formats-guide · Discussion #82 (Geo)Parquet is an extremely useful column oriented data format. When working with local, massive data sets having many millions of features, the performance of this format is second to none. Plent...

Found a rich seam here: github.com/cloudnativeg...

05.10.2025 10:31 — 👍 3    🔁 1    💬 1    📌 0

Got the gdal clip working. First time hearing about ALB, security groups, ECS, services, task definitions, cloudfront, a bunch of other bs. I think I can scale this (so says Claude 🤷‍♂️)

03.10.2025 23:42 — 👍 2    🔁 1    💬 1    📌 0
Data – GeoArrow

- FGB is great! Lookups with small ranges work well but full scans are much slower (particularly if zipped!)
- SedonaDB, DuckDB, and pyarrow with fsspec can all do partial Parquet reads from http (give geoarrow.org/data a try!)

05.10.2025 01:39 — 👍 1    🔁 1    💬 1    📌 0

I did a filter on the ibis example

I think you're talking about tooling problems

04.10.2025 23:57 — 👍 0    🔁 0    💬 0    📌 0

Nice! Could use gdalwarp (gdal raster reproject) to clip and write (and resize or change crs) in one step fwiw

(gdal raster reproject has a --bbox-crs arg, and this can be pipelined or written in json as a GDALG virtual dataset)

04.10.2025 20:31 — 👍 1    🔁 0    💬 1    📌 0
draw custom polygon over wmts imagery service in web mercator, converts to UTM zone 19 and clips the corresponding geotiff (4band, 15cm, 16bit) and delivers to user

draw custom polygon over wmts imagery service in web mercator, converts to UTM zone 19 and clips the corresponding geotiff (4band, 15cm, 16bit) and delivers to user

Hacking together a Clip-Zip-Ship tool: draw custom polygon over wmts imagery service in web mercator, converts to UTM zone 19 coordinates and clips the corresponding geotiff (4band, 15cm, 16bit) and delivers to user.

Next: Select town as clipping boundary, give 100m buffer

04.10.2025 00:05 — 👍 4    🔁 1    💬 2    📌 0

(open issue in nanoparquet was the unfinished bit there)

04.10.2025 20:16 — 👍 0    🔁 0    💬 0    📌 0

right, you mean arrow open_dataset() won't open a URI?

I don't know why that is (or if it's an isolated problem, it's an open issue in )

gist.github.com/mdsumner/e55...

I don't see any issues with queries otherwise in ibis or GDAL

not sure what you mean otherwise

04.10.2025 20:14 — 👍 0    🔁 0    💬 2    📌 0

If you sozip that FGB, it becomes 2GB as well.

Bonus points. Try accessing that GeoParquet remotely on a simple web server. You can't. Not without a backend. Uncompressed FGB? No problem.

04.10.2025 05:10 — 👍 0    🔁 1    💬 1    📌 0

lgtm (it surely can take time for support to roll out for new facilities, and downstream noise always complicates the scene, but it's not inherently bound to a backend)

gist.github.com/mdsumner/e55...

04.10.2025 11:46 — 👍 2    🔁 0    💬 1    📌 0

I cannot say enough about DataFusion...in order to build an engine that considers spatial types at every level we needed to customize types, functions, optimizer rules, joins, Parquet pruning, and more. DataFusion not only made this possible but documented even the most obscure bits. So cool!

25.09.2025 01:35 — 👍 25    🔁 3    💬 0    📌 0

Interesting...I think of GeoParquet and GeoArrow as tech that allows more people to avoid the cloud (i.e., use your laptop to do things that everyone thought they needed "the cloud" for five years ago). When your 14 GB .fgb files become 2GB GeoParquets, you can fit a whole lot more of them locally!

03.10.2025 20:47 — 👍 11    🔁 4    💬 1    📌 1

my most charitable opinion is that I also don't love that basically all software engineering currently requires using bespoke tooling and renting clock time from 1 of 3 corps

but that does mean that everyone everywhere needs tools for dealing with those corp's tech... not just the hyperscalers

03.10.2025 11:42 — 👍 4    🔁 1    💬 0    📌 0
Video thumbnail

🚨 “They put black people in one van and immigrants an another… Trucks and military style vans were used to separate children from their parents.” THIS. IS. HAPPENING. People need to see this. We all need to see it. Nobody gets a pass here. Please share with your networks. 1/2

03.10.2025 00:47 — 👍 10119    🔁 7570    💬 776    📌 678
Preview
R-multiverse: a new way to publish R packages · Community Call Community Call about R-multiverse

You might also be interested in the R-multiverse built on top of R-universe.
ropensci.org/commcalls/r-...

03.10.2025 12:27 — 👍 3    🔁 2    💬 0    📌 0

@mdsumner is following 20 prominent accounts