#Big Data and Spark Notes
blogs:
- http://hortonworks.com/blog/introduction-to-data-science-with-apache-spark/
- https://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying
- https://databricks.com/blog/2015/03/20/using-mongodb-with-spark.html
- https://github.com/mongodb/mongo-hadoop/wiki/Spark-Usage
- https://github.com/chimpler/blog-spark-streaming-log-aggregation
- https://chimpler.wordpress.com/2014/07/01/implementing-a-real-time-data-pipeline-with-spark-streaming/
- http://www.slideshare.net/Hadoop_Summit/building-a-unified-data-pipeline-in-apache-spark
##MongoDB & Spark
- How to Install MongoDB on Mac - https://www.youtube.com/watch?v=iT0datgVcfs
- https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-1-introduction-setup
- https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-2-hive-example
- https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-3-spark-example-key-takeaways