Skip to content

Instantly share code, notes, and snippets.

@rothnic
Created April 7, 2017 11:14
Show Gist options
  • Save rothnic/9654071815933696881464c3f5164021 to your computer and use it in GitHub Desktop.
Save rothnic/9654071815933696881464c3f5164021 to your computer and use it in GitHub Desktop.
Native Spark Cluster on Docker with Jupyter Notebook and PySpark
master:
image: jupyter/pyspark-notebook
command: bash -c "/usr/local/spark/bin/spark-class org.apache.spark.deploy.master.Master -h master && \
start-notebook.sh"
hostname: master
environment:
MASTER: spark://master:7077
SPARK_CONF_DIR: /conf
SPARK_PUBLIC_DNS: localhost
expose:
- 7001
- 7002
- 7003
- 7004
- 7005
- 7006
- 7077
- 6066
ports:
- 4040:4040
- 6066:6066
- 7077:7077
- 8080:8080
- 8888:8888
- 10000:10000
volumes:
- ./conf/master:/conf
- ./data:/home/jovyan/work
worker:
image: jupyter/pyspark-notebook
command: /usr/local/spark/bin/spark-class org.apache.spark.deploy.worker.Worker spark://master:7077 -d /home/jovyan/work
hostname: worker
environment:
SPARK_CONF_DIR: /conf
SPARK_WORKER_CORES: 2
SPARK_WORKER_MEMORY: 1g
SPARK_WORKER_PORT: 8881
SPARK_WORKER_WEBUI_PORT: 8081
SPARK_PUBLIC_DNS: localhost
links:
- master
expose:
- 7012
- 7013
- 7014
- 7015
- 7016
- 8881
ports:
- 8081:8081
volumes:
- ./conf/worker:/conf
- ./data:/home/jovyan/work
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment