Knesset data pipelines

Data processing pipelines for loading, processing and visualizing data about the Knesset

Uses the datapackage pipelines and DataFlows frameworks.

Quickstart for data science

Follow this method to get started quickly with exploration, processing and testing of the knesset data.

Running using Docker

Install Docker for Windows, Mac or Linux

Pull the latest Docker image

docker pull orihoch/knesset-data-pipelines

Create a directory which will be shared between the host PC and the container:

sudo mkdir -p /opt/knesset-data-pipelines

Start the Jupyter lab server:

docker run -it -p 8888:8888 --entrypoint jupyter \
           -v /opt/knesset-data-pipelines:/pipelines \
           orihoch/knesset-data-pipelines lab --allow-root --ip 0.0.0.0 --no-browser \
                --NotebookApp.token= --NotebookApp.custom_display_url=http://localhost:8888/

Access the server at http://localhost:8888/

Open a terminal inside the Jupyter Lab web-ui, and clone the knesset-data-pipelines project:

git clone https://github.com/hasadna/knesset-data-pipelines.git .

You should now see the project files on the left sidebar.

Access the jupyter-notebooks directory and open one of the available notebooks.

You can now add or make modifications to the notebooks, then open a pull request with your changes.

You can also modify the pipelines code from the host machine and it will be reflected in the notebook environment.

Contributing

Looking to contribute? check out the Help Wanted Issues or the Noob Friendly Issues for some ideas.

Useful resources for getting acquainted:

DPP documentation
Code for the periodic execution component
Info on available data from the Knesset site
Living document with short list of ongoing project activities

Name		Name	Last commit message	Last commit date
Latest commit History 765 Commits
bills		bills
bin		bin
committees		committees
data_samples		data_samples
datapackage_pipelines_knesset		datapackage_pipelines_knesset
jupyter-notebooks		jupyter-notebooks
knesset		knesset
laws		laws
lobbyists		lobbyists
members		members
people		people
plenum		plenum
votes		votes
votes_kmember		votes_kmember
web_ui		web_ui
.dockerignore		.dockerignore
.dpp_spec_ignore		.dpp_spec_ignore
.gitignore		.gitignore
.travis-deploy.sh		.travis-deploy.sh
.travis.yml		.travis.yml
Dockerfile		Dockerfile
Dockerfile.full		Dockerfile.full
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
boto.config		boto.config
dataservice_collection_grafana_dashboard.json		dataservice_collection_grafana_dashboard.json
gsutil_cp_data.sh		gsutil_cp_data.sh
rename_resource.py		rename_resource.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knesset data pipelines

Quickstart for data science

Running using Docker

Contributing

About

Releases

Packages

Languages

License

tzoof/knesset-data-pipelines

Folders and files

Latest commit

History

Repository files navigation

Knesset data pipelines

Quickstart for data science

Running using Docker

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages