MapR Technologies, Inc., provider of a leading distribution for Apache™ Hadoop®, announced the availability of free Hadoop On-Demand Training for developers, analysts and administrators.
The next installment of insideBIGDATA’s Data Science 101 series comes from our friends over at LinkedIn.
More than 95 percent of companies have a formal data and analytics strategy in place with many favoring product development, IT and marketing over corporate real estate—until now. A new, independent study conducted by Forrester Consulting, commissioned by JLL, says that 75 percent of firms see corporate real estate information as a core part of a wider corporate data and analytics strategy.
The Random forests machine learning algorithm is a popular ensemble method used by many data scientists to achieve good predictive performance in the classification regime. Fully understanding the nuances of this statistical learning technique is paramount to getting the most out of this algorithm – unfortunately, this means math. The presentation below is from machine learning course CPSC 540 at The University of British Columbia,
In the presentation below, data scientist, author (“Applied Predictive Modeling” with Kjell Johnson) and R caret package developer Max Kuhn sits down for an in-depth interview with Eduardo Arino de la Rubia sponsored by our friends over at DataScience.LA. They discuss the art and science of predictive modeling in the real world, the multifaceted and […]