Sign up for our newsletter and get the latest big data news and analysis.

Ask a Data Scientist: Data Leakage

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” Once a week you’ll see reader submitted questions of varying levels of technical detail answered by a practicing data scientist – sometimes by me and other times by an Intel data scientist. This week’s question is from a reader who asks for an explanation of data leakage.

NTT Comware Deploys MapR to Power Hadoop-as-a-Service for SmartCloud®

BigData_2015

MapR Technologies, Inc., provider of a leading distribution for Apache™ Hadoop®, has announced that NTT Comware is using the MapR Distribution including Hadoop to power its new SmartCloud service. Launched earlier this month for customers in Japan, SmartCloud provides Hadoop-as-a-service to leverage its big data processing infrastructure running in the cloud.

Visualization of the Week: Dueling Measures of Poverty

Visualization_poverty

This new interactive visualization tells a compelling economic story based on U.S. Census Bureau data measuring the reach of poverty in America.

Big Data Humor: Making Your Data Say Uncle

Humor_torture_data

Torturing your data is NEVER worth it … well maybe sometimes!     Sign up for the free insideBIGDATA newsletter.

Ask a Data Scientist: Unsupervised Learning

Dr. Andrew W. Wicker, Data Scientist,  Intel Corporation

Welcome back to the “Ask a Data Scientist” article series. This week’s question is from a reader who asks for an overview of unsupervised machine learning.

Interview: Spencer Greenberg, Chairman, Rebellion Research

Spencer_Greenberg

In the interview below, Rebellion Research’s Chairman Spencer Greenberg discusses how he feels his company is well-positioned for bringing machine learning and AI based asset management to investors.

Ask a Data Scientist: The Data Science Process

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who wonders if there is a general process for conducting data science projects.

Ask a Data Scientist: The Importance of Exploratory Data Analysis

Q: What is the role of exploratory data analysis in data science?

Ask a Data Scientist: Handling Missing Data

How do you handle missing data? What imputation techniques do you recommend?

Big Data Humor: The Art of Statistics

Humor_statistics

A basic stats class at Coursera goes a LONG way!   Sign up for the free insideBIGDATA newsletter.