Spark Packages

The folks from Databricks have launched Spark Packages, a community site hosting modules that are not (directly) part of the Apache Spark project. At time of writing the site contains 16 packages including stuff like launch scripts (GCE, Azure, etc.), integrations (for Kafka, Avro, etc.), utils (testing, RDDs, etc.) and…

Read more

Spark Tutorial University of Maryland

This is a two-and-a-half day tutorial on the distributed programming framework Apache Spark. The class will include introductions to the many Spark features, case studies from current users, best practices for deployment and tuning, future development plans, and hands-on exercises. http://lintool.github.io/SparkTutorial/…

Read more

Databricks Spark Reference Applications

Reference Applications demonstrating Apache Spark - brought to you by Databricks: http://databricks.gitbooks.io/databricks-spark-reference-applications/…

Read more

Spark in the cloud

Databricks announced at the recent Spark Summit 2014 a new way to use Spark: Databricks Cloud, a fully managed Spark-as-a-Service offering including metrics and analytics dashboards. You can now sign up for the beta!…

Read more

Spark SQL performance news

The Databricks team has put together a preview on upcoming performance improvements in Spark SQL.…

Read more
Proudly published with Ghost