Disclaimer: The authors of this document are not in charge of the MIP development and its deployment scripts. They have limited knowledge of most of the elements that are deployed. No guaranties are offered as to the correctness of this document.
This repository contains alternative deployment scripts written in order to demonstrate the feasibility of a simpler deployment procedure.
By using more comprehensively the docker technology, it allows for:
- a simpler installation and deployment process,
- lower software requirements on the host server,
- less infrastructure services as part of the platform, and finally,
- improved security as all the services are isolated from the physical network, unless explicitly published.
If anything, the current scripts contains still too many tunable options, which should be simplified.
At this time, and due to the lack of documentation, the following scripts deploy the MIP-local platform without the Data Factory, as it can be done based on version 2.5.3, from Dec 14 2017.
See also the official documentation of the deployment scripts project on Github:
- README file,
- Installation instructions, and some
- More documentation.
These scripts deploy the following software:
-
On the host machine to support the platform:
-
As docker images as services:
-
As docker images, to setup the various databases required:
- create-databases
- woken-db-setup
- Metadata, one of:
- Packaged Data sets: Public, requires sample-meta-db-setup
- Packaged Data sets: Private, requires mip-cde-meta-db-setup
- Ubuntu 16.04 system or RHEL 7.3+
- git
-
Install the software required
$ sudo ./install-$OS.sh # OS=ubuntu or OS=redhat, depending on your system
-
Configure the platform for anything you wish or need to modify by adding your settings in settings.local.sh
For reference, you can see all the parameters which can be tuned in
settings.default.sh
. -
Optional, you can start a web interface to control the docker engine:
$ ./portainer.sh
-
Load the research data into the
LDSM
If you intend to load the research data stored on gitlab (adni, edsd and ppmi datasets), login first:
$ docker login registry.gitlab.com
Then
$ ./load_data.sh
- If necessary, adapt the Database configuration options in
settings.local.sh
. - Check
settings.default.sh
to see the databases which are currently used by default. You can adapt the listDB_SETUP_LIST
to this effect.
- If necessary, adapt the Database configuration options in
-
Start the platform with
$ ./run.sh up -d
-
Add in the folder pointed by
${DB_DATASETS}
your CSV files, PostgresRAW will pick the up and create tables you can query right away automatically.If your clinical data contains variables used as part of the MIP CDEs, these will be taken into account automatically as long as you name your file
harmonized_clinical_data.csv
.
NOTES: if you set SHOW_SETTINGS=true
a printout of all the settings which will be used will be printed before doing anything.
All the settings have default values, but you can change them by either exporting in your shell the setting with its value, or creating settings.local.sh
in the same folder as settings.sh
:
: ${VARIABLE:="Your value"}
Note: To find the exhaustive list of parameters available please take a look at settings.default.sh
.
Settings are taken in the following order of precedence:
- Shell Environment, or on the command line
- Deployment-specific
settings.local.sh
- Default settings
settings.default.sh