ganeshchand/big-data-spark-notes.md

## big-data-spark-notes.md

      
    Raw
  

              big-data-spark-notes.md
            
          
    #Big Data and Spark Notes
blogs:

http://hortonworks.com/blog/introduction-to-data-science-with-apache-spark/
https://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying
https://databricks.com/blog/2015/03/20/using-mongodb-with-spark.html
https://github.com/mongodb/mongo-hadoop/wiki/Spark-Usage
https://github.com/chimpler/blog-spark-streaming-log-aggregation
https://chimpler.wordpress.com/2014/07/01/implementing-a-real-time-data-pipeline-with-spark-streaming/
http://www.slideshare.net/Hadoop_Summit/building-a-unified-data-pipeline-in-apache-spark


##MongoDB & Spark

How to Install MongoDB on Mac - https://www.youtube.com/watch?v=iT0datgVcfs
https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-1-introduction-setup
https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-2-hive-example
https://www.mongodb.com/blog/post/using-mongodb-hadoop-spark-part-3-spark-example-key-takeaways