Skip to content

Instantly share code, notes, and snippets.

View kranthi1128's full-sized avatar

Kranthi Reddy kranthi1128

View GitHub Profile
-----------Submitting your own Spark Application--------------
* create directory
- run $ mkdir myspark
- run $ cd myspark
* Our application depends on the Spark API, so we’ll also include an sbt configuration file, simple.sbt which explains that Spark is a dependency.
- run $ vim simple.sbt
* In editor
name := "Word Count"
---------Installing Spark Standalone---------
Prerequistes :
* Install java
Set JAVA_HOME in environment variables
* passwordless SSH between master and worker nodes
- $ ssh-keygen -- enter
- Copy the SSH Public Key (id_rsa.pub) to the root account on your target hosts.
.ssh/id_rsa .ssh/id_rsa.pub
- Add the SSH Public Key to the authorized_keys file on your target hosts.
--------- Installing sbt on linux------------
curl https://bintray.com/sbt/rpm/rpm | sudo tee /etc/yum.repos.d/bintray-sbt-rpm.repo
sudo yum install sbt