Sign up for our newsletter and get the latest big data news and analysis.

The Analytics Frontier of the Hadoop Eco-System

Ted Wilkie

“The Hadoop MapReduce framework grew out of an effort to make it easy to express and parallelize simple computations that were routinely performed at Google. It wasn’t long before libraries, like Apache Mahout, were developed to enable matrix factorization, clustering, regression, and other more complex analyses on Hadoop. Now, many of these libraries and their workloads are migrating to Apache Spark because it supports a wider class of applications than MapReduce and is more appropriate for iterative algorithms, interactive processing, and streaming applications.”

Salesforce Delivers Wave, the Salesforce Analytics Cloud

SALESFORCE_LOGO_NEW

Salesforce (NYSE: CRM), a leading CRM vendor, has announced Wave, the Salesforce Analytics Cloud. Wave is the first cloud analytics platform designed for every business user, making it easier than ever for anyone to explore data, uncover new insights and take action instantly from any device.

Why The Strata + Hadoop World Conference Matters

Sundeep_RPM

In this special guest feature, Sundeep Sanghavi, explains why attending the upcoming Strata + Hadoop World Conference is important – Oct. 15-17, 2014 in NYC. Sundeep Sanghavi is the CEO and Co-Founder of DataRPM.

Adopting Big Data for Finance

Guide to Big Data Finance - Thumbnail

This article is the fourth in an editorial series that has the goal to provide direction for enterprise thought leaders on ways of leveraging big data technologies in support of analytics proficiencies designed to work more independently and effectively in today’s climate of working to increase the value of corporate data assets.

European Commission to Invest €2.5 Billion in Big Data

big

“In this talk we summarize the results of the BIG project including analysis of foundational Big Data research technologies, technology and strategy roadmaps to enable business to understand the potential of Big Data technologies across different sectors, together with the necessary collaboration and dissemination infrastructure to link technology suppliers, integrators and leading user organizations.”

Performance Optimization of Hadoop Using InfiniBand RDMA

DK Panda

“The Hadoop framework has become the most popular open-source solution for Big Data processing. Traditionally, Hadoop communication calls are implemented over sockets and do not deliver best performance on modern clusters with high-performance interconnects. This talk will examine opportunities and challenges in optimizing performance of Hadoop with Remote DMA (RDMA) support, as available with InfiniBand, RoCE (RDMA over Converged Enhanced Ethernet) and other modern interconnects.”

New Survey from GE and Accenture Finds Growing Urgency for Big Data Analytics

big-data_logo

A new global study, “Industrial Internet Insights for 2015,” from GE (NYSE: GE) and Accenture (NYSE:ACN) reveals there is a growing urgency for organizations to embrace big data analytics to advance their Industrial Internet strategy.

LexisNexis Launches HPCC Systems® Developer Contest

New-LexisNexis_logo

LexisNexis® Risk Solutions has announced its inaugural HPCC Systems Developer Contest. Developers and other technical professionals have the opportunity to demonstrate how they leveraged HPCC Systems to solve either a Big Data or Complex Query problem.

Types of In-Memory Computing

insideBIGDATA_Guide_IMC

In this installment we’ll set the stage for in-memory computing technology in terms of its current state as well as its next stage of evolution. We’ll begin with a discussion of the capabilities of in-memory databases (IMDBs) and in-memory data grids (IMDGs), and show how they differ. We’ll finish up the section by demonstrating how neither one is sufficient for a company’s strategic move to IMC; instead, we will explain why a comprehensive in-memory data platform is needed.

Predictive Modeling and Production Deployment

insideBIGDATA_Guide_PA

Using predictive analytics involves understanding and preparing the data, defining the predictive model, and following the predictive process. Predictive models can assume many shapes and sizes, depending on their complexity and the application for which they are designed. The first step is to understand what questions you are trying to answer for your organization.