Sign up for our newsletter and get the latest big data news and analysis.

Data Exploration with Databricks

The “Data Exploration on Databricks” jump start video below will show you how go from data source to visualization in a few easy steps. Specifically, you’ll see how to take semi-structured logs, easily extract and transform them, analyze and visualize the data using Spark SQL, so you can quickly understand your data.

Altiscale Increases Performance and Reliability of Hadoop and Spark Platform with Updated Altiscale Data Cloud

Altiscale, Inc., a leading provider of Big Data-as-a-Service, introduced Altiscale Data Cloud 4.0, featuring major upgrades to core Hadoop components, such as HDFS and YARN, and an expanded Spark-as-a-Service offering that supports all major versions of Apache Spark.

Data Science 101: Deep Learning – Theory and Applications

Deep Learning is a hot topic in statistical learning and many data scientists are seeking a place to start. Here is a presentation from the July 23rd SF Machine Learning Meetup at the Workday Inc. San Francisco office. The featured speaker is Ilya Sutskever.

Data Science for Social Good

Jake Porway is the founder and executive director of Datakind. In his Strata+Hadoop Keynote, Jake talks about data for the “best of intentions,” or using data to institute radical change to some of the world’s most pressing problems.

Is Data Science the New Snake Oil?

At the recent Web Summit 2015, Vitaly Gordon, Director of Data Science at Salesforce, delivered a short talk on how to identify all the data science “snake oil” sales pitches happening out there.

The Data Science Revolution

From the 2014 Milken Institute Global Conference, the presentation below includes a panel discussion led by Tim O’Reilly of O’Reilly Media. The panel includes representatives from companies like Rubicon Project, eBay, SAS and Ayasdi.

Advanced Data Science for Healthcare Scheduling Optimization

LeanTaaS iQueue is the flagship product of LeanTaaS. It applies advanced data science and machine learning to overcome healthcare scheduling complexity by optimizing the utilization of scarce resources in order to improve patient flow.

Topological Data Analysis for the Working Data Scientist

The talk below, “Topological Data Analysis for the Working Data Scientist” was presented at the SF Data Mining meetup group. Speaker Anthony Bak begins with a short review of the Mapper algorithm and discuss how to think about problems in the topological framework.

Machine Learning: Hottest Tech Trend in the Next 3-5 Years?

The featured talk focused on – by leveraging big data to allow computers to develop evolving behaviors, machine learning is vastly improving pattern recognition, allowing for broad application such as improved facial and speech recognition for application in many industries, especially national security.

Loop AI Labs Cognitive Computing Platform

The talk below by CTO Bart Peintner of Loop AI Labs was presented at the Deep Learning Summit in Boston on May 26, 2015 and coincides with the launch of the Loop Cognitive Computing Platform.