Orfan

Open Repository for Academic kNowledge

Orfan is a data storage system. The system has a simple structure, all the dataset are stored in a folder structure, with potential sub folders, where each dataset is in one folder. The dataset folder has to contain one "meta.json" file (further descried below) that has the relevant meta data of the data set. This folder structure is then automatically scraped using the orfan.py script to generate a data.js file that is then used by a web page to produce a searchable web page of all the datasets.

In our setup all the datasets are located under the "data" folder and the searchable web page in is found in html/index.html.

How to add a new dataset

Create a folder with the dataset inside the "data" folder. Preferable in one of the subdirectories.
Add representative thumbnails to the dataset folder
Copy the "meta.json" template file from the root to the new folder
Fill in all the relevant information.

Meta data

The "meta.json" file contains a list of meta data that we have considered useful. Some is required and some are optional.

name (required) A name of the dataset, try and be descriptive.
origin (required) Entity or Person that has created/published the dataset.
license (required) How are you allowed to use the dataset.
contact (required) Our contact person / the person that added the dataset.
tags (required) A list of descriptive that for the dataset. First try and use existing tags before adding new ones.
description Information about the dataset.
link A link to the origin if available.
notes Other relevant information.
software A lost of software that can be used to open the dataset. A special "software.json" file is located in the "data" folder that describe different softwares in more detail. In this list we just reefer to items in that file.
acknowledgements A explanation of what you should to acknowledge the creators of the dataset in the case that you use it. For example add something to the acknowledgements section of the paper, or cite this articles.
citations A list of papers that use this dataset. Note this is not the papers that you should site for acknowledgements, they are in the acknowledgements section. A citation contains a type which can be either "plain", "DOI", or "BibTex". And a payload holding the actual citation.
thumbnails (at least one required) A list of thumbnails for the dataset. A thumbnail contains a filename and a caption.
files A list of the relevant files in the dataset. A item in the file list contains
- name (required) The file name. This can be a pattern including wildcards to match multiple files at the same time. For example "dataset-???.dat" can be used to match a series of files. The supported wildcards can be found at https://docs.python.org/3/library/glob.html. This can also be list of names / pattern. The complete list of matched files will be expanded automatically.
- description (required) information about the file.
- format data format.
- resolution resolution of the dataset of available.
- tags list of that for that file

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
doc		doc
html		html
orfan		orfan
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
meta.json		meta.json
orfan.py		orfan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orfan

Open Repository for Academic kNowledge

How to add a new dataset

Meta data

About

Releases

Packages

Contributors 3

Languages

License

SciVis/Orfan

Folders and files

Latest commit

History

Repository files navigation

Orfan

Open Repository for Academic kNowledge

How to add a new dataset

Meta data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages