Skip to content

Instantly share code, notes, and snippets.

@itayB
Created March 7, 2021 17:24
Show Gist options
  • Save itayB/4e9b9c9b8eb9dc2c8cc543496e2835c7 to your computer and use it in GitHub Desktop.
Save itayB/4e9b9c9b8eb9dc2c8cc543496e2835c7 to your computer and use it in GitHub Desktop.
Dockerfile for jupyter notebook + Spark
FROM itayb/spark:3.1.1-hadoop-3.2.0-aws
RUN pip install \
notebook==6.2.0 \
ipynb==0.5.1 \
sparkmonitor==1.1.1 \
pyspark==3.1.1
# install extension to monitor spark
RUN jupyter nbextension install sparkmonitor --py --user --symlink
RUN jupyter nbextension enable sparkmonitor --py
RUN jupyter serverextension enable --py --user --debug sparkmonitor
RUN ipython profile create && \
echo "c.InteractiveShellApp.extensions.append('sparkmonitor.kernelextension')" >> $(ipython profile locate default)/ipython_kernel_config.py
RUN ln -s /usr/local/lib/python3.8/site-packages/sparkmonitor/listener_2.12.jar /opt/spark/jars/listener_2.12.jar
VOLUME /home/notebook/
CMD jupyter notebook --port=8888 --ip=0.0.0.0 --no-browser --allow-root --NotebookApp.token='' --notebook-dir=/home/notebook/
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment