Sign up for our newsletter and get the latest big data news and analysis.

Looking at Spark from a Hadoop Lens


This article is the third in a series that explores a high-level view of how and why many companies are deploying Apache Spark as a solution for their big data technology requirements.

Big Data Humor: Do You Have a Hiring Advantage?


Only a data scientist can predict the unpredictable!   Download the insideBIGDATA Guide to Scientific Research

Spark MLlib: Making Practical Machine Learning Easy and Scalable


In this talk, Xiangrui Meng of Databricks shares his experience in developing MLlib. The talk covers both higher-level APIs, ML pipelines, that make MLlib easy to use, as well as lower-level optimizations that make MLlib scale to massive data sets.

Updated WANdisco Fusion Platform Offers Hybrid Cloud, Active Back-up for Enterprises


WANdisco, (LSE: WAND) a leading provider of continuous-availability software for global enterprises to meet the challenges of Big Data, announced major updates to its flagship WANdisco Fusion Platform.

Splice Machine Announces Version 2.0 of its RDBMS: A Hybrid In-Memory Architecture Powered by Hadoop and Spark


Splice Machine announced the 2.0 version of its RDBMS, a hybrid in-memory RDBMS powered by Hadoop and Spark. Splice Machine’s version 2.0 delivers a database solution that incorporates the proven scalability of Hadoop, ANSI SQL, ACID transactions, and the in-memory performance of Spark.

Datameer Selected by to Optimize Customer Website Experience with Big Data Analytics


In an effort to improve its overall customer experience,, a leader in online and mobile travel, has selected Datameer’s self-service big data analytics solution to better understand customer behavior, analyze offer effectiveness, and improve operational processes.

Argyle Data Outlines Five Key 2016 Predictions for Native Hadoop Applications


Vikash Varma, President and CEO of Argyle Data, a leader in native Hadoop applications for threat analytics in mobile communications, has provided a perspective on the company’s momentum and industry outlook for 2016. Varma’s predictions for substantially accelerated growth are fueled by the growing need for data-driven applications and fraud analytics for the mobile communications industry that run natively on Hadoop.

Why is Apache Spark So Hot?


An Insider’s Guide to Apache Spark is a useful new resource directed toward enterprise thought leaders who wish to gain strategic insights into this exciting new computing framework. As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are driving new opportunities for application development as well as increased intake of IT infrastructure. This article is the second in a series that explores a high-level view of how and why many companies are deploying Apache Spark as a solution for their big data technology requirements.

Advanced Apache Spark


Big data is going Spark crazy! Here’s a whopping 6 hour intensive, fast-paced and vendor agnostic look at Spark Core presented by Sameer Farooqui, a client services engineer at Databricks.

Dutch Utility Deploys Software Defined Power Plant by AutoGrid Systems


AutoGrid Systems, a leader in big data analytics for the electricity and energy industry, today announced that Eneco Group, the leading Dutch sustainable energy group dedicated to helping its clients to save, use, exchange or sell energy, which serves more than 2 million residential and business users, has selected its Predictive Controls™ technology and its flagship application, DROMS, to build and deploy the industry’s first Software Defined Power Plant™.