Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: Support Vector Machines

Support Vector Machines (SVM) is an important and widely used machine learning algorithm. In order to fully understand SVMs, you need to have a fundamental understanding of how the statistical learning method functions. Here is a useful lecture on SVM coming from MIT OpenCourseware.

BeyondCore V Delivers One-Click Smart Pattern Discovery in Minutes

BeyondCored_logo

Business analytics innovator BeyondCore has announced the production release of BeyondCore V, one-click business analytics software designed for business users, setting the new industry gold standard for automated business analytics.

Deep Learning, Self-Taught Learning and Unsupervised Feature Learning

The video presentation below is a highly compelling talk by Stanford University professor and Coursera co-founder, Dr. Andrew Ng. Andrew addresses a graduate summer school audience at UCLA’s IPAM (Institute for Pure & Applied Mathematics) on the topic – Deep Learning, Feature Learning.

How ScaleOut Software Provides Operational Intelligence

ScaleOut_logo

In this video, Bill Bain from ScaleOut Software describes how the company’s in-memory data grid technology provides operational intelligence.

Sumo Logic Launches Transaction Analytics to Transform How Companies Learn from Business Events

Sumo Logic, the next generation machine data intelligence company, has announced availability of Transaction Analytics as a component of the Sumo Logic analytics platform to deliver deep visibility into causal relationships across distributed IT systems in order to better inform business decisions.

Data Science 101: Data Agnosticism – Feature Engineering Without Domain Expertise

From the SciPy2013 conference, here is a compelling talk “Data Agnosticism: Feature Engineering Without Domain Expertise” by Nicholas Kridler of Accretive Health in Chicago.

Doug Cutting Interview

In the presentation below, Hadoop luminary Doug Cutting gives us some of his perspectives on the big data industry as well as a high-level overview of the Hadoop technology stack.

The Analytics Frontier of the Hadoop Eco-System

Ted Wilkie

“The Hadoop MapReduce framework grew out of an effort to make it easy to express and parallelize simple computations that were routinely performed at Google. It wasn’t long before libraries, like Apache Mahout, were developed to enable matrix factorization, clustering, regression, and other more complex analyses on Hadoop. Now, many of these libraries and their workloads are migrating to Apache Spark because it supports a wider class of applications than MapReduce and is more appropriate for iterative algorithms, interactive processing, and streaming applications.”

Salesforce Delivers Wave, the Salesforce Analytics Cloud

SALESFORCE_LOGO_NEW

Salesforce (NYSE: CRM), a leading CRM vendor, has announced Wave, the Salesforce Analytics Cloud. Wave is the first cloud analytics platform designed for every business user, making it easier than ever for anyone to explore data, uncover new insights and take action instantly from any device.

European Commission to Invest €2.5 Billion in Big Data

big

“In this talk we summarize the results of the BIG project including analysis of foundational Big Data research technologies, technology and strategy roadmaps to enable business to understand the potential of Big Data technologies across different sectors, together with the necessary collaboration and dissemination infrastructure to link technology suppliers, integrators and leading user organizations.”