Sign up for our newsletter and get the latest big data news and analysis.

Interview: Subash D’Souza – Big Data Day LA

Subash

The upcoming event in Los Angeles, Big Data Day LA on June 27, serves as a model for the industry where bringing a quality no-cost conference to the tech community is top priority. I caught up with one of the organizers, Big Data guru Subash D’Souza, to get his thoughts on this conference and what makes it a success.

R vs. Python – The Infographic

Python_R

I always enjoy a good down-in-the-trenches battle story – Godzilla vs. King Kong, Klingons vs. Romulans, Yankees vs. Red Sox, etc. And now there’s one for data science aficionados – R vs. Python.

Gartner Identifies Cool Vendors in Storage Technologies for 2015

big_data_storage

A new Gartner research report, “Cool Vendors in Storage Technologies,” details five emerging vendors that can assist organizations in meeting their storage modernization and cost containment initiatives. For example, Storiant is a multifaceted archiving platform that can support active, compliance and historical archives in a private cloud environment.

Accenture Launches Advanced Analytics Applications Platform

accenture-logo

Accenture (NYSE: ACN) is launching the Accenture Analytics Applications Platform to develop industry- and function-specific advanced analytics applications, delivering actionable insights to users that enable swift data-driven decisions that can create a competitive advantage or an improved bottom line.

Hadoop 101: Simplifying MapReduce Development

hadoop-101

This article explores some typical configuration roadblocks found in traditional Hadoop platforms and explains how they can be sidestepped using ScaleOut hServer using the WordCount application to illustrate these benefits.

The Rise of the Data Lake in Support of the Industrial Internet (Part 2)

Data_Lake

The data lake (see Part 1 of this two part series) is a facilitator of another new movement in the big data industry – the Industrial Internet, a term coined by General Electric that refers to the integration of complex physical machinery with networked sensors and software.

Beta Release of Google Cloud Bigtable Announced

google-bigtable

Today, Google has introduced Google Cloud Bigtable – a fully managed, high-performance, extremely scalable NoSQL database service accessible through the industry-standard, open-source Apache HBase API. Under the hood, this new service is powered by Bigtable, the same database that drives nearly all of Google’s largest applications.

AstroCompute in the Cloud Grant Program Launches

SKA_AWS_Logo

The Square Kilometer Array (SKA) Organisation and AWS are launching the AstroCompute in the Cloud grant program to accelerate the development of innovative tools and techniques for processing, storing and analyzing the global astronomy community’s vast amounts of astronomic data in the cloud.

The Rise of the Data Lake in Support of the Industrial Internet (Part 1)

Data_Lake

Data lakes are enterprise-wide data management platforms designed for storing and analyzing vast amounts of information from disparate data sources in their native format. The idea is to place data into a data lake in their native structure instead of a repository built for a specific purpose such as a data warehouse or data mart.

DataFest Competition Brings Big Data to College Students

Data Science

Students from more than 20 prestigious colleges and universities recently tried their hand at “Big Data” analysis at seven different campuses around the country during DataFest, an annual month-long data-analytics competitive event sponsored by the American Statistics Association.