Dockerized hadoop environment with spark, jupyter, livy(REST apis for spark)
- Apache Hadoop 2.8.4
- Apache Spark 2.4.0
- Jupyter Notebook
- Conda Environment
- Apache Livy (REST apis for spark)
Pull docker image from docker hub repository
$ docker pull bhavik9243/hadoop-spark-jupyter:latest
$ docker run -itd --name hadoop_cluster --hostname localhost -v /Users/bhavik/work/notebooks:/root/notebooks -p 8888:8888 -p 8998:8998 -p 4040:4040 -p 50070:50070 -p 50075:50075 -p 8088:8088 -p 8042:8042 bhavik9243/hadoop-spark-jupyter:latest
$ docker start hadoop_cluster
$ docker stop hadoop_cluster
HDFS : http://127.0.0.1:50070
YARN : http://127.0.0.1:8088
Jupyter Notebook : http://127.0.0.1:8888
Password :
letmein
LIVY UI : http://127.0.0.1:8998
Explore more about LIVY : Livy Documentation