Altiscale, Inc., a leading provider of Hadoop-as-a-Service, today announced that Apache Spark is now available on the Altiscale Data Cloud. Altiscale customers can now leverage Apache Spark on Apache Hadoop in order to achieve their critical analytical and business objectives.
“NVIDIA will present an update on accelerated computing, in particular, the latest de- velopments in the platform. They will touch upon NVLink, OpenPOWER, ARM64, and new software updates and also cover the broad-sweeping impact that a new field of machine learning, called Deep Learning, is having on applications and domains.”
The next installment of insideBIGDATA’s Data Science 101 series comes from our friends over at LinkedIn.
The Random forests machine learning algorithm is a popular ensemble method used by many data scientists to achieve good predictive performance in the classification regime. Fully understanding the nuances of this statistical learning technique is paramount to getting the most out of this algorithm – unfortunately, this means math. The presentation below is from machine learning course CPSC 540 at The University of British Columbia,
In the presentation below, data scientist, author (“Applied Predictive Modeling” with Kjell Johnson) and R caret package developer Max Kuhn sits down for an in-depth interview with Eduardo Arino de la Rubia sponsored by our friends over at DataScience.LA. They discuss the art and science of predictive modeling in the real world, the multifaceted and […]
Data Science is the key to unlocking insight from Big Data: by combining computer science skills with statistical analysis and a deep understanding of the data and problem we can not only make better predictions, but also fill in gaps in our knowledge, and even find answers to questions we hadn’t even thought of yet.