We provide instructions for recreating the SNOME-CT Core dataset used in our work.
- Apply for a UMLS license here.
- Download the 2020AA release of the UMLS. Store the top-level directory (2020AA) in
robust-kg-completion/data/UMLS/
. - Download the August 2020 version of the CORE Problem List Subset Data Files from here. Store the data files in
robust-kg-completion/data/UMLS/SNOMEDCT_CORE_SUBSET_202008
- Run
./scripts/extract_snomed.sh
.