Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Data provenance and preparation #875

Open
4 tasks
Tracked by #974
abelsiqueira opened this issue Oct 14, 2024 · 1 comment
Open
4 tasks
Tracked by #974

Data provenance and preparation #875

abelsiqueira opened this issue Oct 14, 2024 · 1 comment
Labels
epic Epic issues (collection of smaller tasks towards a goal)

Comments

@abelsiqueira
Copy link
Member

abelsiqueira commented Oct 14, 2024

Description

Based on the discussion that we had on 14/Oct/2024 in person, here is an overview of the things we need.
Notice that these happen before TulipaEnergyModel or even TulipaClustering, in general.

Sub issues

  • UI-Friendly TulipaIO
    There needs to be a way to convert from a user-oriented data file (e.g., wide format) to a Tulipa-oriented format
  • Find a data storage solution
    The EU data is too large and needs to be moved to some place to be easier to download and to prevent downloading it with every clone.
    Furthermore, we want to allow users to easily download the data for examples
  • Limit data-loading to only the necessary queries
    Somewhere (where?), the whole data will be stored (e.g., 50 years of data). We only want to load into memory a selection of this data (e.g., 1 year). How to do it? Does this imply a file format or style or whatever for expected data storage?
  • "Source of truth"
    How can we set up a source of truth kind of data? Consider the situation where the correct data is made available by some "higher" entity. This data is collected by some TNO employee, and then moved into another place/format that is considered the "truth".
    What can be used for this? Microsoft Access is the current solution.

Related:

@abelsiqueira abelsiqueira added the epic Epic issues (collection of smaller tasks towards a goal) label Oct 14, 2024
@clizbe
Copy link
Member

clizbe commented Mar 6, 2025

For data storage I think we should consider Zenodo. It takes data in all forms, complete or in-progress. We can also have it be private at first we want, and only share it with certain people.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
epic Epic issues (collection of smaller tasks towards a goal)
Projects
None yet
Development

No branches or pull requests

2 participants