Skip to content

Instantly share code, notes, and snippets.

View codspire's full-sized avatar
🎯
Focusing

Rakesh Nagar codspire

🎯
Focusing
  • Washington D.C.
View GitHub Profile
@codspire
codspire / running-flink-locally-on-windows-10.md
Last active January 4, 2024 12:42
Running Flink Locally on Windows 10
@codspire
codspire / spark-standalone-pyspark-on-ec2.md
Last active June 9, 2020 16:17
Running Standalone Spark, PySpark on EC2
@codspire
codspire / getting-started-with-superset-airbnb-data-exploration-platform.md
Last active February 12, 2024 21:41
Getting Started With Superset: Airbnb’s data exploration platform

Getting Started With Superset: Airbnb’s data exploration platform

Update Python and PIP versions on EC2 (Amazon AMI)

At the time of writing, Python v3.5 and PIP v9.0.1 were available on AWS EC2.

sudo yum update -y
sudo yum install python35 -y
@codspire
codspire / making-zeppelin-work-on-windows.md
Last active December 2, 2021 03:54
Making Zeppelin, Spark, pyspark work on Windows

Zeppelin, Spark, PySpark Setup on Windows (10)

I wish running Zeppelin on windows wasn't as hard as it is. Things go haiwire if you already have Spark installed on your computer. Zeppelin's embedded Spark interpreter does not work nicely with existing Spark and you may need to perform below steps (hacks!) to make it work. I am hoping that these will be fixed in newer Zeppelin versions.

If you try to run Zeppelin after extracting the package, you might encounter "The filename, directory name, or volume label syntax is incorrect."

Google search landed me to https://issues.apache.org/jira/browse/ZEPPELIN-1584, this link was helpful but wasn't enough to get Zeppelin working.

Below is what I had to do to make it work on my Windows 10 computer.

@codspire
codspire / PythonCLIOnGitBash.md
Last active June 12, 2017 00:56
Make Python CLI interpreter work on GitBash

Make Python CLI interpreter work on GitBash

Add below lines to ~/.bashrc file

alias python="winpty python.exe"

Restart the console or source ~/.bashrc

Access EC2 from behind the firewall that block port 22

If your corporate firewall does not allow ssh to EC2 instances due to blocked port 22 you can follow below workaround temporarily to ssh to EC2 instances.

Essentially you need to change the sshd configuration to use port 80 which is typically open in most firewalls to allow http access.

Step 1: Launch a new EC2 instance

Step 2: On "3. Configure Instance" step, expand the "Advanced Details" section and put the below code