Created
May 31, 2020 20:51
-
-
Save nbren12/057016069e1d68f40cf659a6fbb3c61f to your computer and use it in GitHub Desktop.
I thought carefully about the Zarr issue, and I think you're absolutely right that Zarr is not the right fit for your output data. My comments about potential support for uneven chunks are not really relevant. We should absolutely be using Parquet for this. Perhaps you could go one step further and actually use Parquet in the post?
In order to do this, we would just have to convert to a dask dataframe. Have you tried that?
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@nbren - sorry if I came off as overly critical today. It's really an excellent post about an important use case. Sometimes I have to wear the hat of Pangeo PR manager, which means I'm [overly] sensitive to the language used to describe the things we are working on.
Here are a few specific examples of how you might rephrase a few sentences in a more optimistic way. "Fail" IMO in particular is a very strong word that should be used sparingly, as it carries quite negative connotations.
Here I would definitely link out to dask/dask#5544. I believe we can really get this fixed.
More thoughts soon...