Skip to content

Instantly share code, notes, and snippets.

@jaceklaskowski
Last active March 24, 2017 11:43
Show Gist options
  • Save jaceklaskowski/0e1e8690808c6982e9a203f9864765f4 to your computer and use it in GitHub Desktop.
Save jaceklaskowski/0e1e8690808c6982e9a203f9864765f4 to your computer and use it in GitHub Desktop.
Spark Summit West 2016 Sparked My Interest -- Spark Summit West 2016 in San Francisco (to review at the earliest convenience)

Things that Sparked My Interest

Agenda

Links to Review

Grouped by topic

Apache Spark 2.0

  1. (video) Apache Spark 2.0 by Matei Zaharia (Databricks)

Deep Learning / Machine Learning

  1. Spark Summit keynote explores structured streaming, innovation in deep learning | #SparkSummit
  2. (video) AI: The New Electricity by Andrew Ng (Baidu)

Kafka

  1. spark-kafka-writer

Others

  1. (video) Sparking the Intelligent Cloud by Joseph Sirosh (Microsoft)
  2. (video) Big Data in Production — Lessons from Running in the Cloud by Marvin Theimer (Amazon)
  3. What’s old is new again: Combining old and new data to increase value | #SparkSummit
  4. (video) Disrupting Big Data with Apache Spark in the Cloud by Ali Ghodsi (Databricks)
  5. SPARK-ON-HBASE: DATAFRAME BASED HBASE CONNECTOR
  6. Spark Summit: Databricks’ Community Edition, Splice Machine moves to open source, and Hortonworks’ Spark-HBase Connector
  7. Examining Spark 2.0 with Reynold Xin
  8. Microsoft, MapR announce new Apache Spark-based releases
  9. Microsoft eyes spiking industry for data science | #SparkSummit
  10. silex -- something to help you spark
  11. SparkSummit, Spark 2.0, Data Fellas and more
  12. Databricks Community Edition is now Generally Available -- A free learning platform for Apache Spark
  13. Another Record-Setting Spark Summit Event Brings Together 2500+ Apache Spark Users to San Francisco
  14. Spark Summit keynote: Combating gaps in real-time analytics | #SparkSummit
  15. Will Spark’s continuous app innovation eclipse other data tools? | #SparkSummit
  16. Spark Summit 2016 Demo
  17. A look into the future of Apache Spark 2.0
  18. GPU Support in Spark and GPU/CPU Mixed Resource Scheduling at Production Scale
  19. A scalable machine learning library on Apache Spark from LinkedIn
  20. Open Sourcing Photon ML - LinkedIn’s Scalable Machine Learning Library for Spark
  21. CaffeOnSpark
  22. Redis Labs Empowers Real time Big Data Insights for Apache Spark Users
  23. 3 Emerging Open Source Data Analytics Tools Beyond Apache Spark
  24. SNOWFLAKE SPARKLES WITH NATIVE APACHE™ SPARK CONNECTOR
  25. Accelerate Your Analytics with the New MapR Platform including Spark
  26. Microsoft expands its commitment to Apache Spark big-data framework
  27. Microsoft announces major commitment to Apache Spark
  28. #SparkSummit West 2016 preview: The power of 2.0
  29. Announcing the New Couchbase Spark Connector
  30. Apache Spark for Azure HDInsight now generally available
  31. Livy is an open source REST interface for interacting with Apache Spark from anywhere
  32. http://livy.io/
  33. Spark Connector 1.2
  34. Apache Spark Integrated with Jupyter and Spark Job Server
  35. Simplifying streaming with Spark 2.0: The easier way to continuous applications | #SparkSummit
  36. Introducing the Neo4j 3.0 Apache Spark Connector
  37. MCSD - MapR Certified Spark Developer
  38. Building Realtime Data Pipelines with Kafka Connect and Spark Streaming (video) + slides
  39. Spark Salesforce Connector Tutorial Using JDBC
  40. CDAP – Taking Spark Apps from Prototype to Production
  41. Don’t leave data behind: Spark as a translator for data | #SparkSummit
  42. Get started: Create Apache Spark cluster on HDInsight Linux and run interactive queries using Spark SQL
  43. GPUEnabler -- Provides GPU awareness to Spark

Not necessarily Spark (yet closely related)

  1. Parallel builds in Maven 3
  2. docker-squash -- Squash docker images to make them smaller
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment