Skip to content

subich/spark-in-docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Docker Image for Apache Spark

This repo contains a Dockerfile, Spark supporting files, and a compose file for running a complete Spark cluster in Docker.

It is based on alpine linux and installs everything required to run Python 3 applications in Spark.

The docker-compose file will start a master node, a worker node, and the Spark history server.

It can be scaled by passing --scale worker=n to the docker compose up command, where n is the number of desired worker nodes.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published