Skip to content

Instantly share code, notes, and snippets.

View tiagotele's full-sized avatar

Tiago tiagotele

  • Aquiraz, Ceará - Brazil
View GitHub Profile
#!/bin/bash
SPARK_VERSION='spark-3.1.2'
SPARK_URL=https://archive.apache.org/dist/spark/$SPARK_VERSION/$SPARK_VERSION-bin-without-hadoop.tgz
echo "Downloading pre-built PySpark..."
wget $SPARK_URL -P /tmp
echo "Done!"
# Unpack PySpark
#!/bin/bash
HADOOP_VERSION='3.3.1'
HADOOP_URL=https://dlcdn.apache.org/hadoop/common/hadoop-$HADOOP_VERSION/hadoop-$HADOOP_VERSION.tar.gz
echo "Downloading Hadoop jars..."
wget $HADOOP_URL -P /tmp
echo "Done!"
# Unpack Hadoop