Sign up for our newsletter and get the latest big data news and analysis.

Data Exploration with Databricks

The “Data Exploration on Databricks” jump start video below will show you how go from data source to visualization in a few easy steps. Specifically, you’ll see how to take semi-structured logs, easily extract and transform them, analyze and visualize the data using Spark SQL, so you can quickly understand your data.

Spark MLlib: Making Practical Machine Learning Easy and Scalable

In this talk, Xiangrui Meng of Databricks shares his experience in developing MLlib. The talk covers both higher-level APIs, ML pipelines, that make MLlib easy to use, as well as lower-level optimizations that make MLlib scale to massive data sets.

Advanced Apache Spark

Big data is going Spark crazy! Here’s a whopping 6 hour intensive, fast-paced and vendor agnostic look at Spark Core presented by Sameer Farooqui, a client services engineer at Databricks.

Apache Spark is the Smartphone of Big Data

In this special guest feature, Denny Lee of Databricks, talks about the versatility of Spark – essentially comparing it to the Swiss Army Knife of on your camping tri​p, called​ Big Data/Analytics.

Apache Spark Outgrowing Hadoop as Users Increasingly Move to the Cloud

Databricks, the company founded by the creators of Apache Spark, released the findings of a survey of more than 1,400 respondents from the Spark community to identify how organizations and users are utilizing the data analytics and processing engine.

Databricks Launches New Features to Bring Apache Spark to More Enterprise Users

Databricks, the company founded by the creators of Apache Spark, launched major enhancements to its cloud-based platform with the 2.0 release.

Databricks Announces General Availability of Its Cloud Platform

Databricks, the company behind Apache Spark, today announced the general availability of its cloud-hosted data platform (formerly known as Databricks Cloud).

5 Reasons Data Analytics in the Cloud Will Take Center Stage in 2015

In this special guest feature, Dave Wang of Databricks enumerates the main reasons that data analytics in the cloud are becoming a top priority for enterprises in 2015.

Tresata Sparks Anti Money Laundering Revolution With Databricks Cloud

Databricks — the company founded by the creators of the popular open-source big data processing engine Apache Spark with its flagship product, Databricks Cloud — and Tresata Inc., a provider of Hadoop-powered predictive analytics software, announced a joint solution that combines Databricks Cloud with Tresata TEAK, a predictive, real-time Anti Money Laundering (AML) solution.

Databricks Announces ‘Jobs’ Feature for Databricks Cloud

Databricks — the company founded by the creators of the popular open-source big data processing engine Apache Spark with its flagship product, Databricks Cloud — introduced “Jobs,” a feature for Databricks Cloud at the inaugural Spark Summit East.