Spark Packages

The folks from Databricks have launched Spark Packages, a community site hosting modules that are not (directly) part of the Apache Spark project. At time of writing the site contains 16 packages including stuff like launch scripts (GCE, Azure, etc.), integrations (for Kafka, Avro, etc.), utils (testing, RDDs, etc.) and…

Read more

Spark 1.2 released

Today, Apache Spark 1.2 has been released with the following highlights: Improved Spark Core Spark Streaming now more or less fully available via Python as well MLLib has been improved GraphX is now stable Big congrats and thanks to the team!…

Read more

The Essential Apache Spark Cheat Sheet

DZone provides now an Apache Spark Cheat Sheet: This Refcard introduces Spark, explains its place in the big data ecosystem, walks through setup and creation of a basic Spark application, and explains commonly used actions and operations.…

Read more

Spark Tutorial University of Maryland

This is a two-and-a-half day tutorial on the distributed programming framework Apache Spark. The class will include introductions to the many Spark features, case studies from current users, best practices for deployment and tuning, future development plans, and hands-on exercises. http://lintool.github.io/SparkTutorial/…

Read more

Databricks Spark Reference Applications

Reference Applications demonstrating Apache Spark - brought to you by Databricks: http://databricks.gitbooks.io/databricks-spark-reference-applications/…

Read more

Spark Panel Discussion with Cloudera, MapR & Pivotal

The Los Angeles Spark Users Group recently hosted a panel discussion on Spark, featuring respresentatives of three Big Data vendors: http://inside-bigdata.com/2014/10/08/spark-panel-discussion-cloudera-mapr-pivotal/…

Read more

Next-generation web analytics processing

Folks from Adobe Research made a prototype Spark-based web analytics query engine called Spindle available via GitHub.…

Read more

Spark Summit 2014 training material

The Spark Summit 2014 training material is now available. The course material (2GB in total!) includes slide decks, videos and more—an invaluable source for uptraining in the Spark space!…

Read more

Seattle Spark Meetup Roundup

The most recent Seattle Spark Meetup covered topics as wide as: the Spark Summit 2014, xPatterns, Mesos, Tachyon, IPython Notebox and Machine Learning.…

Read more
Proudly published with Ghost