Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: Excel Tutorial on Analyzing Large Data Sets

Data Science

Our friends over at Udemy partnered with data scientist David Taylor (specialist in data spelunking and visualization) to create a fun (and free) Excel tutorial on analyzing large data sets.

MapR Free Hadoop On-Demand Training Exceeds 10,000 Registrants in First Month

MapR Logo - New 2014_FEATURE

MapR Technologies, Inc., provider of a top-ranked distribution for Apache™ Hadoop®, announced its newly available free Hadoop On-Demand Training program has enrolled more than 10,000 registrants worldwide in its first 30 days, signaling strong interest in Hadoop skills training.

Free eBook! Software Defined Storage for Dummies

Software_Defined_Storage_Dummies_feature

Download your FREE copy of “Software Defined Storage for Dummies” today, compliments of IBM Platform Computing! This new learning resource can help enterprise thought leaders better understand the new area of software define storage in support of big data initiatives. Software defined storage is a relatively new concept in the computing and storage industry and […]

Statistics is the Fastest Growing Undergraduate STEM Degree

statistics-logo

Statistics—the science of learning from data—is the fastest-growing science, technology, engineering and math (STEM) undergraduate degree in the United States over the last four years, an analysis of federal government education data conducted by the American Statistical Association (ASA) revealed.

MapR Unveils Free Hadoop On-Demand Training Program

MapR Logo - New 2014_FEATURE

MapR Technologies, Inc., provider of a leading distribution for Apache™ Hadoop®, announced the availability of free Hadoop On-Demand Training for developers, analysts and administrators.

Data Science 101: NoSQL Data Modelling

With the big data education resource below, you can learn about data modelling in a NoSQL environment.

Ask a Data Scientist: Ensemble Methods

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks about ensemble methods and how you use them.

Ask a Data Scientist: Confounding Variables

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks for an explanation of confounding variables and why they’re important in data science projects.

Data Science 101: Cassandra Tutorial for Beginners

Provided by our friends over at Edureka, Module 1 of their Apache Cassandra course below discusses the fundamental concepts of using a highly-scalable, column-oriented database to implement appropriate use cases.

Data Science 101: Support Vector Machines

Support Vector Machines (SVM) is an important and widely used machine learning algorithm. In order to fully understand SVMs, you need to have a fundamental understanding of how the statistical learning method functions. Here is a useful lecture on SVM coming from MIT OpenCourseware.