Sign up for our newsletter and get the latest big data news and analysis.

Data Science for Social Good

Jake Porway is the founder and executive director of Datakind. In his Strata+Hadoop Keynote, Jake talks about data for the “best of intentions,” or using data to institute radical change to some of the world’s most pressing problems.

Is Data Science the New Snake Oil?

At the recent Web Summit 2015, Vitaly Gordon, Director of Data Science at Salesforce, delivered a short talk on how to identify all the data science “snake oil” sales pitches happening out there.

The Data Science Revolution

From the 2014 Milken Institute Global Conference, the presentation below includes a panel discussion led by Tim O’Reilly of O’Reilly Media. The panel includes representatives from companies like Rubicon Project, eBay, SAS and Ayasdi.

Advanced Data Science for Healthcare Scheduling Optimization

LeanTaaS iQueue is the flagship product of LeanTaaS. It applies advanced data science and machine learning to overcome healthcare scheduling complexity by optimizing the utilization of scarce resources in order to improve patient flow.

Topological Data Analysis for the Working Data Scientist

The talk below, “Topological Data Analysis for the Working Data Scientist” was presented at the SF Data Mining meetup group. Speaker Anthony Bak begins with a short review of the Mapper algorithm and discuss how to think about problems in the topological framework.

Machine Learning: Hottest Tech Trend in the Next 3-5 Years?

The featured talk focused on – by leveraging big data to allow computers to develop evolving behaviors, machine learning is vastly improving pattern recognition, allowing for broad application such as improved facial and speech recognition for application in many industries, especially national security.

Loop AI Labs Cognitive Computing Platform

The talk below by CTO Bart Peintner of Loop AI Labs was presented at the Deep Learning Summit in Boston on May 26, 2015 and coincides with the launch of the Loop Cognitive Computing Platform.

Cloudera Navigator Optimizer Provides Active Data Optimization for Hadoop Workloads

Cloudera, provider of the data management and analytics platform built on Apache Hadoop and the latest open source technologies, announced the availability of Cloudera Enterprise 5.5. This release continues to improve the performance, security, and functionality of analytics on Hadoop and includes the limited beta release of Cloudera Navigator Optimizer for improved workload performance and efficiency.

Sisense Introduces Version 6 to Simplify Business Analytics

Sisense, a leader in simplifying business analytics for complex data, announced the launch of Sisense version 6. This new version of its business analytics software solution significantly expands Sisense’s advanced analytics capabilities, as well as furthers the platform’s ability to integrate across the business analytics ecosystem.

Advanced Apache Spark

Big data is going Spark crazy! Here’s a whopping 6 hour intensive, fast-paced and vendor agnostic look at Spark Core presented by Sameer Farooqui, a client services engineer at Databricks.