Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: What’s Coming for Spark in 2015

Spark_logo_feature

Apache Spark took the data science world by storm in 2014 as a technology foundation for big data applications. In the talk below from the Bay Area Spark User Meetup, Patrick Wendell from Databricks speaks about new developments in Spark and identifies areas of focus in the coming year.

Statistics is the Fastest Growing Undergraduate STEM Degree

statistics-logo

Statistics—the science of learning from data—is the fastest-growing science, technology, engineering and math (STEM) undergraduate degree in the United States over the last four years, an analysis of federal government education data conducted by the American Statistical Association (ASA) revealed.

Real-time Object Recognition Using Deep Learning

deeplearning

Here is a status report on some exciting new technology from our friends at NVIDIA. CEO Jen-Hsun Huang showcases the computer vision capabilities of the NVIDIA DRIVE PX deep neural network computer vision auto-pilot computer at the company’s press event kicking off CES 2015.

GPU Accelerated Platforms for Deep Learning

sumit

“NVIDIA will present an update on accelerated computing, in particular, the latest de- velopments in the platform. They will touch upon NVLink, OpenPOWER, ARM64, and new software updates and also cover the broad-sweeping impact that a new field of machine learning, called Deep Learning, is having on applications and domains.”

MapR Unveils Free Hadoop On-Demand Training Program

MapR Logo - New 2014_FEATURE

MapR Technologies, Inc., provider of a leading distribution for Apache™ Hadoop®, announced the availability of free Hadoop On-Demand Training for developers, analysts and administrators.

Data Science 101: Machine Learning – The Basics

The next installment of insideBIGDATA’s Data Science 101 series comes from our friends over at LinkedIn.

The C-Suite Will Demand More Real Estate Data In 2015

BigData_real_estate

More than 95 percent of companies have a formal data and analytics strategy in place with many favoring product development, IT and marketing over corporate real estate—until now. A new, independent study conducted by Forrester Consulting, commissioned by JLL, says that 75 percent of firms see corporate real estate information as a core part of a wider corporate data and analytics strategy.

Data Science 101: Random Forests

Machine_Learning

The Random forests machine learning algorithm is a popular ensemble method used by many data scientists to achieve good predictive performance in the classification regime. Fully understanding the nuances of this statistical learning technique is paramount to getting the most out of this algorithm – unfortunately, this means math. The presentation below is from machine learning course CPSC 540 at The University of British Columbia,

G2 Crowd Publishes Winter 2015 Rankings for Digital Analytics Platforms

Twitter_analytics

The first Grid℠ report for digital analytics platforms, published by business software review site G2 Crowd, ranks 12 products to help purchasers in their selections. The Winter 2015 report is based on more than 420 reviews written by business professionals.

How Enterprises Really Feel About Big Data In The Cloud

Prat_Moghe_pic

In this special guest feature, Prat Moghe of Cazena gives his thoughts about doing big data in the cloud. Prat Moghe is CEO & Founder of Cazena, a company with a mission statement to make gathering information easier for Fortune 2000 companies who want to analyze it.