Sid's Avatar

Sid

@sid-sub.bsky.social

Founder terrafloww.com, Data engineer Loves nature, space, and geo tech

46 Followers  |  148 Following  |  15 Posts  |  Joined: 29.11.2024  |  1.9558

Latest posts by sid-sub.bsky.social on Bluesky

⚑️Quick stat, Rasteret library gets 1 year's worth of #landsat satellite's #NDVI time series for a small farm is less than 4 seconds!

Blog - blog.terrafloww.com/rasteret-a-l...

Code - github.com/terrafloww/r...

#geospatial

12.01.2025 17:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks to @maxlenormand.bsky.social , Soumya ranjan from @developmentseed.org , my colleague Gajesh Ladhar from Satsure , for thier early feedbacks.

12.01.2025 07:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - terrafloww/rasteret: A library for fast reads of Cloud Optimized Geotiff satellite images, using GeoParquet as COG metadata cache A library for fast reads of Cloud Optimized Geotiff satellite images, using GeoParquet as COG metadata cache - terrafloww/rasteret

Been working on 'Rasteret' since the last blog I wrote, its out now as an early release.

More details - blog.terrafloww.com/rasteret-a-l...

Open to feedback and contributions, there is much more exciting work to do!
github.com/terrafloww/r...

#geospatial #cloudnativegeo #opensource

12.01.2025 06:37 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

Sorry to hear about this. Take care.

10.01.2025 15:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

True totally agreed that its tough. But im wondering how PyPI gets funded by donations from both community developers and companies like Meta and so on, i guess it just due to sheer numbers? Geo is pretty small compared to general Python.

09.01.2025 13:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It doesn't need to another private company setting it up and owning it. It can be truly open and community + private industry funded. Unlike PyPI which hosts full libraries, the geo dataset registry doesn't need to keep STACs or any datasets inside it. A few Postgres/ES instances might be enough.

09.01.2025 12:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I mean PyPI registry's framework/design and governance is close to what can be done for geo as well.

09.01.2025 12:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I think PyPI is interesting. Its completely funded by donations. Controlled by python software foundation. Also, since I thinking there will just be pointers to STACs and Non-STACs I don't think it will be costly to maintain.

09.01.2025 12:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Similar to PyPI, data producers can push just metadata of datasets in toml files, to a central registry. It would only contain a "summary metadata" of entire STACs, parquet/csv files, APIs like OSM. With thier total bbox, quality metrics etc. A registry like this can be more easily governed i feel?

09.01.2025 08:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Christopher Beddow on LinkedIn: #gischat #maps #api #data #geospatial | 14 comments A universal map data API aggregator? Does it exist? I saw this for flight and travel APIs and some others. You can subscribe to this service, and it acts as… | 14 comments on LinkedIn

@cbed.bsky.social had put up a question on LinkedIn about finding all datasets STAC or not. And his idea was an aggregator of datasets in YAML. Similarly I feel PyPI registry is a good thing to emulate, with geopip to get data and pass it to pystac/duckdb/requests based on details sent via toml

09.01.2025 07:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
A cropping of the immigration flow chart that reads β€œBorn in India? Y -> DENIED”

A cropping of the immigration flow chart that reads β€œBorn in India? Y -> DENIED”

I know it shouldn’t but this bit is so petty it made me lol

03.01.2025 16:29 β€” πŸ‘ 24    πŸ” 1    πŸ’¬ 2    πŸ“Œ 1

The completely open source code of iceberg and the basic REST catalog still provides great features for most people. Read/filter, write/append cloud based data (usually parquet) using just Pyiceberg. Merge/update rows are possible via Trino/Spark engines, it should come soon to Pyiceberg as well.

17.12.2024 08:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
A journey of optimization of cloud-based geospatial data processing | Hacker News

Feels good to have #geospatial blog on Hacker News front page!

Chime in, on the comments if you wish!

news.ycombinator.com/item?id=4241...

15.12.2024 17:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

As I get closer to releasing the open source library which I will call, "Rasteret" , short for raster-retrieval, I reflected on the blog I published last week, and edited it to include some more details regarding compute and its code samples.

Check it out and do share your thoughts!

#geospatial

08.12.2024 13:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Could you add me please? Thanks

06.12.2024 05:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Efficient Cloud-Native Raster Data Access: An Alternative to Rasterio/GDAL A journey of optimization of cloud-based geospatial data processing. Introducing a new approach to raster data access, harnessing the power of STAC GeoParquet and cloud-native workflows to push the bo...

We discovered a 5x faster approach to querying raster time series data at Terrafloww.

Inspired by #cloudnativegeospatial (GeoParquet, COG, GeoArrow), we built a novel query method.
Read more & share your thoughts!

blog.terrafloww.com/efficient-cl...

#geospatial #cloudnativegeo

30.11.2024 09:01 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

@sid-sub is following 20 prominent accounts