Skip to content

myrandaGoesToSpace/NCEI_metadatabase

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NCEI Metadatabase

Constructing a graph database for NCEI metadata

Overview

This repository demonstrates the creation of a graph metadatabase using metadata from NCEI's Geoportal (https://www.ncei.noaa.gov/metadata/geoportal/#searchPanel). The metadata is collected using a Catalogue Service for the Web (CSW) and converted to Resource Description Framework (RDF) format for import into a graph database.

How to Use

Libraries

The scripts in this repo use Python 3.7 and a combination of packages listed below. You can download these using pip install:

  • Beautiful Soup
  • KGLab
  • RDFLib
  • Numpy
  • Pandas
  • Matplotlib

NOTE: An environment package will be posted in future updates to eliminate the need to install multiple packages.

Visualization

The graph visualization is demonstrated in visualization_example.ipynb. Run this notebook in Jupyter Notebook to visualize a portion of the NCEI metadata graph.

Scripts

The script request_to_rdf.py converts a CSW request to an RDF file which can then be used in a graph database. To run this script, open the file, edit the URL and desired number of results accordingly (default is to search for first 10 items on https://www.ncei.noaa.gov/metadata/geoportal/), and run the script with the command python request_to_rdf.py.

Planned Updates

  • Environment package
  • Expanded options of vocabularies for metadata

About

Linking NCEI metadata to enable better data discovery

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published