“This talk will provide an overview of challenges in accelerating Hadoop, Spark and Memcached on modern HPC clusters. An overview of RDMA-based designs for multiple components of Hadoop (HDFS, MapReduce, RPC and HBase), Spark, and Memcached will be presented. Enhanced designs for these components to exploit in-memory technology and parallel file systems (such as Lustre) will be presented. Benefits of these designs on various cluster configurations using the publicly available RDMA-enabled packages from the OSU HiBD project (http://hibd.cse.ohio-state.edu) will be shown.”
Deep Learning is a relatively new area of Machine Learning research which has been introduced with the objective of moving Machine Learning closer to one of its original goals: Artificial Intelligence. The video presentation below is from the 2016 Stanford HPC Conference, where Brian Catanzaro from Baidu presents: “Scaling Deep Learning.”
As the use of GPUs continues to rise in fields like deep learning, we thought it would be useful to readers not yet familiar with this technology to offer the “Introduction to GPU Computing” presentation below.
I was excited to attend a very compelling Meetup featuring Anthony Goldbloom, co-founder and CEO of Kaggle who talked about the genesis of his company and what they’ve learned along the way – “What has Kaggle Learned from 2 Million Machine Learning Model?” It was fascinating!
Talend, a global leader in big data integration software, today introduced Talend Data Preparation, a self-service application that enables business users to simplify and expedite the often laborious and time consuming process of data wrangling or the data manipulation and analysis tasks that are often performed using spreadsheets.
SAS and OSIsoft are showcasing how predictive analytics from SAS and infrastructure-management software from OSIsoft can transform asset data from IoT connected devices into an optimized grid with the Salt River Project powered by SAS.
In the presentation below, Seth Juarez of DevExpress discusses architecting predictive algorithms for machine learning.
Splunk Inc. (NASDAQ: SPLK), provider of a leading software platform for real-time Operational Intelligence, announced that VenueNext, creator of integrated technology platforms for venues, is using Splunk® Enterprise to help the Levi’s Stadium operational teams make real-time, data-driven decisions that enhance operational effectiveness, precision and control.
Trying to show the data analysis package R is no more scary than Excel, John Mount of the Win-Vector blog shows a simple analysis both in Excel and in R.
In the talk below, Recursive Deep Learning for Modeling Compositional and Grounded Meaning, Richard Socher, Founder, MetaMind describes deep learning algorithms that learn representations for language that are useful for solving a variety of complex language tasks.