-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Store the NW2 baseline simulations on the cloud #8
Comments
I think we can ask OSN for more space. I will do that and follow up here when I have a response. |
Thanks for checking with OSN @rabernat! Have you heard back from them? |
It would be good if we could make progress on this issue soon, since the revisions of several manuscripts depend on having the data openly available. If OSN is not an option, what could be alternatives? |
@NoraLoose Thanks for that! I agree we need to find a solution since we have to submit the revisions of NW2 (soonish). |
Sorry for not replying sooner. Yes, we can store the data in OSN via Pangeo Forge! To ingest the data, the next step is to open an issue here: https://github.com/pangeo-forge/staged-recipes/issues @NoraLoose - if you can get that started, I will chime in and help move it forward. |
Things have evolved considerably since I wrote that. Pangeo Forge didn't even exist yet. 🙃 The instructions are not wrong per se, but we have better systems now. This is explains what Pangeo Forge is and why we are building it: https://pangeo-forge.readthedocs.io/en/latest/what_is_pangeo_forge.html |
@gustavo-marques, @adcroft, @LaureZanna and others: Before we start the process over at pangeo-forge, let's agree on which of the NeverWorld2 output files we want to make available on the cloud. We have 8 experiments (4 different resolutions, with
Do we want to make all of these available? Am I missing some? |
@NoraLoose, thank you very much for working on this! The only file you missed is Vertical_coordinate.nc, but I do not know if anyone is using this file in their analysis. We do not use it in the NW2 description paper. @ElizabethYankovsky, please let us know if you use this file in your analysis. We might want to combine ocean.stats.* into single files since these files are small. The 0.25 deg runs have a lot of restart files. Perhaps we only need to upload a few of them (every 10 years?). |
Thanks @NoraLoose and @gustavo-marques! No, I'm not using the Vertical_coordinate.nc file in my analysis. |
Looks to me @NoraLoose ! |
Yes, I think we can skip the Could you work on combining the ocean.stats.* into a single file? (Simply concatenating the files will result in double time stamps.) Then I will get the process going at pangeo-forge in the meantime. As for the restart files: Maybe we only need the last (and possibly first) restart file, so people can extend the NW2 time series for each resolution? |
@NoraLoose: yes, I will combine the ocean.stats.* for each experiment. Thanks! |
@NoraLoose: I've combined the |
Great to see this proposal! The main blocker here is getting the data off of glade. In order for Pangeo Forge to access the data, CISL needs to upgrade their Globus installation to V5. Fortunately, according to CISL, that should be very soon (days or weeks). @gustavo-marques and @NoraLoose - could you let me know your UCAR usernames so I can have you added to the Globus trial? |
My username is noraloose. |
Mine is gmarques. |
Our Cloud Data Guide states that the
CPT has access to an a 10 TB allocation on OSN
. To store all the baselines we will need ~ 17.5 TB of storage (assuming the compression when converting the data into Zarr won't matter too much). We only saved the "full output" over the last 2,000 days of simulation, so there is not much data cleaning to be done.Any ideas on how we should proceed?
cc'ing @rabernat @adcroft @NoraLoose
The text was updated successfully, but these errors were encountered: