-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Virtual Zarr Support #11
Comments
Hi @alxmrs! Thanks for sharing. I am not sure how feasible it is to access It would be feasible to write client libraries for other programming languages, but the For VirtualiZarr, is this a server side implementation to return data in chunks (Like Apache Arrow Feather)? Or is the general idea to read Additionally, this data repository requires meta attributes which are hard-coded in the primary Open-Meteo repository. E.g. Information about the data grid, time resolution and length of each time-chunk. This would be easy to expose as a JSON file. |
Hey Patrick, Thanks for your response! A few thoughts:
It may not be feasible... yet. But, I think it could be. The beauty of Zarr is that it's more of an array protocol than a file format. Given a few modifications -- namely ZEP003, I think it will be possible to directly read In this scenario, VirtualiZarr would be run up-front in a batch setting to provide metadata files to a bucket somewhere (say, this open data on S3). From there, Zarr-clients should be able to read the files directly. And, ideally, folks would be able to read I'm happy to hear that the open-meteo constants can be exposed in JSON; that makes me suspect such an integration with Zarr is more feasible.
That may be. But, I think you've addressed a really important access pattern that I suspect folks in the Zarr community would want to integrate with (xref: google-research/arco-era5#12). One major benefit I see with having |
This is quite a catalog of weather data! Congrats, I’m really impressed.
If you wanted this catalog to be truly overpowered (e.g. useful to the weather researchers, geospatial scientists, etc), I recommend finding a way to make all this data accessible via Zarr. With Zarr V3 around the corner, you should be able to add a few metadata files at the top of each bucket to make everything Zarr-accessible (in a language agnostic way). Here are a few pointers to get started:
Patrick, please reach out to me over email — I’d love to collaborate with you on what you’re building. (al(at)merose(dot)com).
The text was updated successfully, but these errors were encountered: