Sign up for our newsletter and get the latest big data news and analysis.

7 Steps for a Developer to Learn Apache Spark

This ebook introduces steps for a developer to understand Spark, at a deeper level, and speaks to the Spark 2.x’s three themes— easier, faster, and smarter. Whether you’re getting started with Spark or already an accomplished developer, this ebook will arm you with the knowledge to employ all of Spark 2.x’s benefits. To learn more download: 7 Steps for a Developer to Learn Apache Spark

Configuration for Big-Data-as-a-Service

This white paper describes a new solution for Big-Data-as-a-Service combining the BlueData EPIC (Elastic Private Instant Clusters) software platform with the HPE Elastic Platform for Big Data Analytics (EPA). BlueData is transforming how enterprises deploy their Big Data applications and infrastructure. To learn more about Big-Data-as-a-Service download this white paper.

Bare-Metal Performance for Big Data Workloads on Docker Containers

In a benchmark study, Intel compared the performance of Big Data workloads running on a bare-metal deployment versus running in Docker containers with the BlueData EPIC software platform. The study found that it is possible to run Big Data workloads in a container-based environment without sacrificing performance. The benefits include agility, flexibility, and cost efficiency. Data science teams can get on-demand Hadoop and Spark clusters, while leveraging enterprise-grade security in a multi-tenant architecture. Get the white paper to learn about this breakthrough benchmark study.

Interactive Analytics, Visualization and Data Modeling on large Hadoop Data Sets

The vision of Hadoop as more than a data store is finally a reality. Thanks to advances from SQL query engines like Spark SQL, Impala, Presto and Hive on Tez, big data technologies are now accessible for business analytics. With Looker, analysts can build a data model across all their data in Hadoop – easily transforming raw data into meaningful metrics and finally allowing business teams to access and explore years of stored data in Hadoop data sets.

Overcome analytic challenges on Hadoop data:

Get more from your Hadoop cluster by analyzing the data where it sits
No need to move or transform the data prior to performing analysis
Interact with your data through a familiar language – SQL on Hadoop
Create a single source of truth for your enterprise that’s governed by a data model
To learn more download this white paper.

How To Scale Business Intelligence With Hadoop- Based Platforms

Analyzing big data poses multiple challenges. Highly parallel distributed data architecture is one solution, but until recently it has been mostly limited to databases, not business intelligence (BI) application servers. In this report, application development and delivery (AD&D) pros working on BI initiatives will learn about the capabilities of distributed BI platforms mostly based on […]

Big Data Cybersecurity Analytics Research Report Ponemon Institute

In this report, Ponemon Institute is pleased to present the findings of Big Data Cybersecurity Analytics, sponsored by Cloudera.

SQL Engine Leads the Heard

In almost every organization, SQL is at the heart of enterprise data used in transactional systems, data warehouses, columnar databases and analytics platforms to name just a few examples. Additionally, a vast number of commercial and in-house developed tools used to access, manipulate and visualize data rely on SQL. SQL is lifeblood of the modern transaction and decision support systems.

TDWI Hadoop Readiness Guide

An organization’s readiness for Hadoop is not a single state held by a single entity. Corporations, government agencies, educational institutions, healthcare providers, and other types of organizations are complex in that they have multiple departments, lines of business, and teams for various business and technology functions. Each function can be at a different state of readiness for Hadoop, and each function can affect the success or failure of Hadoop programs.

Presto: Open Source for the Enterprise

Presto addresses a real need for a portable SQL on Hadoop tool. It is architected from the ground up for high performance interactive query processing. Open source is a fount of continual innovation, especially with regard to big data. In addition, there are strong tools that come with specific Hadoop distributions. The fact is that organizations will deploy multiple tools. For organizations moving toward a Unified Data Architecture, the rationale for adopting Presto is even stronger.

Hadoop Appliances and Teradata

A White paper by Philip Howard, Bloor Research International Ltd on critical considerations for Hadoop deployments and the role of appliances.