Sign up for our newsletter and get the latest big data news and analysis.

7 Steps for a Developer to Learn Apache Spark

This ebook introduces steps for a developer to understand Spark, at a deeper level, and speaks to the Spark 2.x’s three themes— easier, faster, and smarter. Whether you’re getting started with Spark or already an accomplished developer, this ebook will arm you with the knowledge to employ all of Spark 2.x’s benefits. To learn more download: 7 Steps for a Developer to Learn Apache Spark

The Definitive Guide to Evaluating Cloud-based Apache Spark Platforms

This guide is designed to help you focus on your overall company goals. Do you want to build and manage your own Spark environment or leverage the best possible choice on the market? Find a solution you can use as an effective tool for the real work of getting business value from big data analytics. To learn more download this definitive guide to evaluating cloud-based Apache Spark platforms.

Apache Spark Survey 2016 Report

More than 1,600 members of the Apache Spark community from over 900 organizations have spoken, and Spark continues to be the most active open-source project in the big data space today. The 2016 Databricks Apache Spark Survey shows a rise in production deployments of Spark in the public cloud, as well as an increased usage […]

InsideBIGDATA: An Insider’s Guide to Apache Spark

Apache Spark is an open source cluster computing framework originally developed in 2009 at the AMPLab at University of California, Berkeley but was later donated in 2013 to the Apache Software Foundation where it remains today. Spark allows for quick analysis and model development, plus it provides access to the full data set thus avoiding the need to subsample, as often needed in environments like R.