Sign up for our newsletter and get the latest big data news and analysis.

Research Firm Advises Analytics Stakeholders and Security Professionals to Build Plans for Securing Hadoop-based Assets

Dataguise, a technology leader in secure business execution, announced inclusion in a report by Gartner titled, “Rethink and Extend Data Security Policies to Include Hadoop.” The report provides best practices for addressing data security concerns related to Apache Hadoop deployments and highlights several leading vendors in the category to support these endeavors.

Monte Carlo Simulations in Ad-Lift Measurement Using Spark

In this talk from Spark Summit East 2016, Prasad Chalasani explores some of the challenges that arise in setting up scalable simulations in a specific application, and share some solutions and lessons learned along the way, in the realms of mathematics and programming.

Interview: Natalia Hernandez, Data Scientist at Foodpairing

I recently caught up with Natalia Hernandez, Data Scientist at Foodpairing, to highlight how her company’s data scientists mine public online data, which gives general trend insights to use consumer intelligence and molecular analysis of ingredients to forecast the next big flavors in the food industry.

Splice Machine’s New OLAP Engine Adds Columnar Storage and In-Memory Caching to its Hybrid Relational Data Platform

Splice Machine, provider of the open-source SQL RDBMS powered by Apache Hadoop® and Apache Spark™, announced the release of version 2.5 of its industry-leading data platform for intelligent applications. The new version strengthens its ability to concurrently run enterprise-scale transactional and analytical workloads, frequently referred to as HTAP (Hybrid Transactional and Analytical Processing).

Hortonworks Advances Cloud Strategy with Availability of Hortonworks Data Cloud for Amazon Web Services

Hortonworks, Inc. ® (NASDAQ: HDP), a leading innovator of open and connected data platforms, announced the availability of Hortonworks Data Cloud on the Amazon Web Services (AWS) Cloud. Hortonworks Data Cloud for AWS enables users to harness the agility and elasticity of Apache® Hadoop™ and Apache® Spark™ in the cloud for powering new workloads and analytic applications. The new cloud service, powered by open source, delivers the most popular enterprise-grade capabilities of Hortonworks Data Platform (HDP®) with both hourly and annual billing options available on the AWS Marketplace.

AtScale 5.0 – Modern Business Intelligence Platform, Enables BI on Hadoop, On Premises and Cloud

AtScale, the company to provide enterprises with a fast and secure self-service BI platform for Big Data, announced a significant expansion of its services, from BI on Hadoop to BI on Big Data. With this announcement, the company introduces a Modern BI Platform that enables businesses to work seamlessly across all of Big Data, on premise and in the Cloud. In addition to Hadoop, the AtScale platform now supports Teradata data warehouses and Google Dataproc and BigQuery. This expands on the company’s existing support for Microsoft Azure and HDInsight.

ODPi Publishes Operations Specification Providing Developers Consistency Across Application Management Tools

ODPi, a nonprofit organization accelerating the open ecosystem of big data solutions, announced the availability of ODPi 2.0, which includes the first release of the ODPi Operations Specification and the Runtime Specification 2.0, to standardize the development model for big data solution and application providers and help enterprises improve installation and management of Hadoop-based applications.

MapR Event Recap: It’s All About Digital Transformation

Last week saw two compelling local big data events here in So Cal, both sponsored by MapR. I thought I’d provide a short recap of the events for those who were unable to attend. I was on a panel for the first event, “Digital Transformation in Big Data” and the discussion revolved around MapR’s unique vision for the “3 Keys to Digital Transformation.” For a detailed discussion, these points are well described in a recent blog post.

Manage Deploys MemSQL for Real-Time Analytics

MemSQL, provider of the database platform for real-time analytics, revealed that Manage, a leader in programmatic mobile marketing and advertising, has deployed MemSQL to power real-time analytics. Manage helps enterprises like Uber, Wish, and Amazon manage global mobile marketing campaigns by buying real-time programmatic inventory to drive the most engaging users to mobile applications.

Interview: Gwen Shapira, System Architect at Confluent

I recently caught up with Gwen Shapira, System Architect at Confluent, to talk about the market dynamics of new “fast data” technologies and what is driving its rapid adoption across large companies.