Sid's Avatar

Sid

@sid-sub.bsky.social

Founder terrafloww.com, Data engineer Loves nature, space, and geo tech

47 Followers  |  148 Following  |  18 Posts  |  Joined: 29.11.2024  |  1.784

Latest posts by sid-sub.bsky.social on Bluesky

Preview
We must stop selling files: The case for streaming tensor... Rasteret sped up geo image reads by 10x. But users still spend 80% time doing ETL before even feeding GPUs. Its like having a fast car on roads full of potholes. We investigate why, and share our solu...

Broadcast to sell, buyers see the feed and stream the pixels they want not files, monitor and monetize each byte.

If you are building for GeoAI, your data infrastructure shouldn't be your product, focus on your science and logic. Read our full blog here: blog.terrafloww.com/streaming-te...

22.01.2026 08:51 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Introducing Rasteret SDK & Terrafloww Platform.
We are killing the "search, discover, and file download" ceremony.

β€’ Rasteret SDK: Stream EO images from S3 to GPU.
β€’ Terrafloww Platform: Data Discovery & Metering

Think "YouTube for Geo Imagery"

22.01.2026 08:49 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Defense "ate" everything in geospatial in 2025. They bought proprietary systems that just work. The commercial EO market is left with "Download to Desktop" buttons and draw polygons on map to get data.
Fresh out of the AWS Space Accelerator, we're launching the counter-move. 🧡

22.01.2026 08:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

⚑️Quick stat, Rasteret library gets 1 year's worth of #landsat satellite's #NDVI time series for a small farm is less than 4 seconds!

Blog - blog.terrafloww.com/rasteret-a-l...

Code - github.com/terrafloww/r...

#geospatial

12.01.2025 17:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thanks to @maxlenormand.bsky.social , Soumya ranjan from @developmentseed.org , my colleague Gajesh Ladhar from Satsure , for thier early feedbacks.

12.01.2025 07:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
GitHub - terrafloww/rasteret: A library for fast reads of Cloud Optimized Geotiff satellite images, using GeoParquet as COG metadata cache A library for fast reads of Cloud Optimized Geotiff satellite images, using GeoParquet as COG metadata cache - terrafloww/rasteret

Been working on 'Rasteret' since the last blog I wrote, its out now as an early release.

More details - blog.terrafloww.com/rasteret-a-l...

Open to feedback and contributions, there is much more exciting work to do!
github.com/terrafloww/r...

#geospatial #cloudnativegeo #opensource

12.01.2025 06:37 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 1    πŸ“Œ 1

Sorry to hear about this. Take care.

10.01.2025 15:46 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

True totally agreed that its tough. But im wondering how PyPI gets funded by donations from both community developers and companies like Meta and so on, i guess it just due to sheer numbers? Geo is pretty small compared to general Python.

09.01.2025 13:11 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

It doesn't need to another private company setting it up and owning it. It can be truly open and community + private industry funded. Unlike PyPI which hosts full libraries, the geo dataset registry doesn't need to keep STACs or any datasets inside it. A few Postgres/ES instances might be enough.

09.01.2025 12:12 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I mean PyPI registry's framework/design and governance is close to what can be done for geo as well.

09.01.2025 12:02 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

I think PyPI is interesting. Its completely funded by donations. Controlled by python software foundation. Also, since I thinking there will just be pointers to STACs and Non-STACs I don't think it will be costly to maintain.

09.01.2025 12:01 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Similar to PyPI, data producers can push just metadata of datasets in toml files, to a central registry. It would only contain a "summary metadata" of entire STACs, parquet/csv files, APIs like OSM. With thier total bbox, quality metrics etc. A registry like this can be more easily governed i feel?

09.01.2025 08:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Christopher Beddow on LinkedIn: #gischat #maps #api #data #geospatial | 14 comments A universal map data API aggregator? Does it exist? I saw this for flight and travel APIs and some others. You can subscribe to this service, and it acts as… | 14 comments on LinkedIn

@cbed.bsky.social had put up a question on LinkedIn about finding all datasets STAC or not. And his idea was an aggregator of datasets in YAML. Similarly I feel PyPI registry is a good thing to emulate, with geopip to get data and pass it to pystac/duckdb/requests based on details sent via toml

09.01.2025 07:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 3    πŸ“Œ 0
A cropping of the immigration flow chart that reads β€œBorn in India? Y -> DENIED”

A cropping of the immigration flow chart that reads β€œBorn in India? Y -> DENIED”

I know it shouldn’t but this bit is so petty it made me lol

03.01.2025 16:29 β€” πŸ‘ 24    πŸ” 1    πŸ’¬ 2    πŸ“Œ 1

The completely open source code of iceberg and the basic REST catalog still provides great features for most people. Read/filter, write/append cloud based data (usually parquet) using just Pyiceberg. Merge/update rows are possible via Trino/Spark engines, it should come soon to Pyiceberg as well.

17.12.2024 08:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
A journey of optimization of cloud-based geospatial data processing | Hacker News

Feels good to have #geospatial blog on Hacker News front page!

Chime in, on the comments if you wish!

news.ycombinator.com/item?id=4241...

15.12.2024 17:36 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

As I get closer to releasing the open source library which I will call, "Rasteret" , short for raster-retrieval, I reflected on the blog I published last week, and edited it to include some more details regarding compute and its code samples.

Check it out and do share your thoughts!

#geospatial

08.12.2024 13:58 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Could you add me please? Thanks

06.12.2024 05:22 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Efficient Cloud-Native Raster Data Access: An Alternative to Rasterio/GDAL A journey of optimization of cloud-based geospatial data processing. Introducing a new approach to raster data access, harnessing the power of STAC GeoParquet and cloud-native workflows to push the bo...

We discovered a 5x faster approach to querying raster time series data at Terrafloww.

Inspired by #cloudnativegeospatial (GeoParquet, COG, GeoArrow), we built a novel query method.
Read more & share your thoughts!

blog.terrafloww.com/efficient-cl...

#geospatial #cloudnativegeo

30.11.2024 09:01 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 0    πŸ“Œ 1

@sid-sub is following 20 prominent accounts