#Data Mining
##Knowledge Discovery in Databases
- Types:
- Association Rules**
- Causality (Interestingness, Conviction)
- Clustering
- Classification
- Sequential Patterns
- Association Rules
#Data Mining
##Knowledge Discovery in Databases
This will be a quick guide to get you introduced with one of the most popular and effective tools used for working with big data. Apache Spark is a cluster computing platform designed to be fast and general-purpose. On the speed side, Spark extends the popular MapReduce model to efficiently suport more types of computations, including interactive queries and stream processing.
spark-notebook.py
in /home/vagrant
directory.python spark_notebook.py
. This wi