Understanding Intention – Using Content, Context, and the Crowd to Build Better Search Applications

This white paper by enterprise search specialists Lucidworks, points out that unlike consumer search, which has become a seamless part of our everyday lives, the enterprise side might as well still be running Windows 95. Imagine if Amazon, Google, or Facebook treated every user the same, regardless of who they are, where they are, what they’re searching for, and what they’ve clicked. Your users expect that same sophistication in their enterprise apps.

The Data Scientist’s Guide to Apache Spark™

For data scientists looking to apply Apache Spark’s advanced analytics techniques and deep learning models at scale, Databricks is happy to provide The Data Scientist’s Guide to Apache Spark. Download this eBook to: Learn the fundamentals of advanced analytics and receive a crash course in machine learning. Get a deep dive on MLlib, the primary […]

7 Steps for a Developer to Learn Apache Spark

This ebook introduces steps for a developer to understand Spark, at a deeper level, and speaks to the Spark 2.x’s three themes— easier, faster, and smarter. Whether you’re getting started with Spark or already an accomplished developer, this ebook will arm you with the knowledge to employ all of Spark 2.x’s benefits. To learn more download: 7 Steps for a Developer to Learn Apache Spark

The Definitive Guide to Evaluating Cloud-based Apache Spark Platforms

This guide is designed to help you focus on your overall company goals. Do you want to build and manage your own Spark environment or leverage the best possible choice on the market? Find a solution you can use as an effective tool for the real work of getting business value from big data analytics. To learn more download this definitive guide to evaluating cloud-based Apache Spark platforms.

Apache Spark Survey 2016 Report

More than 1,600 members of the Apache Spark community from over 900 organizations have spoken, and Spark continues to be the most active open-source project in the big data space today. The 2016 Databricks Apache Spark Survey shows a rise in production deployments of Spark in the public cloud, as well as an increased usage […]

InsideBIGDATA: An Insider’s Guide to Apache Spark

Apache Spark is an open source cluster computing framework originally developed in 2009 at the AMPLab at University of California, Berkeley but was later donated in 2013 to the Apache Software Foundation where it remains today. Spark allows for quick analysis and model development, plus it provides access to the full data set thus avoiding the need to subsample, as often needed in environments like R.