Sign up for our newsletter and get the latest big data news and analysis.

Topological Data Analysis for the Working Data Scientist

Data Science

The talk below, “Topological Data Analysis for the Working Data Scientist” was presented at the SF Data Mining meetup group. Speaker Anthony Bak begins with a short review of the Mapper algorithm and discuss how to think about problems in the topological framework.

Andrew Ng Talks Deep Learning at Bay Learn 2015


In this short presentation, Dr. Andrew Ng (world renowned deep learning luminary, Chief Scientist of Baidu; Chairman and Co-Founder of Coursera; Stanford CS faculty) talks about what’s going on with deep learning and how it is rapidly changing the problem domains that can be addressed with machine learning. In particular, Ng announces a deep learning […]

TensorFlow: Second Generation Deep Learning System


Jeff Dean of Google presented this talk at BayLearn 2015. “In this talk, I’ll highlight some of the lessons we have learned in using our first-generation distributed training system and discuss some of the design choices in our second-generation system. I’ll then discuss ways in which we have applied this work to a variety of problems in Google’s products, usually in close collaboration with other teams.”

Data Science & Analytics at Birchbox

Data Science

Liz Crawford, CTO of Birchbox, presented at Data Driven NYC in October 2015. She gave a behind-the-scenes look at Birchbox’s Data Science & Analytics practice.

Analysis Paralysis – an Overtold Cliché or a Case in Point?

Smita Adhikary

In this special guest feature for our Data Science 101 channel, Smita Adhikary of Big Data Analytics Hires highlights how data scientists sometimes tend to get bogged down in the “how” of a problem rather than the “why” of it, and end up delivering highly predictive, yet essentially meaningless models for the business.

Data Science 101: Getting a Free Mathematics Education for Data Science

Black Square Button with Diploma and Hat

In this special feature, Daniel D. Gutierrez, Managing Editor of insideBIGDATA provides a number of free educational resources designed for budding data scientists to gain a foundation in mathematics.

Data Science 101: An Introduction to scikit-learn – Machine Learning in Python


The tutorial presentation below offers an introduction to the scikit-learn package and to the central concepts of Machine Learning.

Data Science 101: Using the RForcecom R Package for Salesforce

Data Science

In the short instructional video below, you’ll learn how to set up RForcecom, read an opportunity list from Salesforce and utilize a decision tree machine learning algorithm.

Data Science 101: Is Logistic Regression Dead?

Smita Adhikary

In this special guest feature for our Data Science 101 channel, Smita Adhikary of Big Data Analytics Hires shares her thoughts about how the data science community has changed over the years – many useful tips for those just entering the field. Smita is a Managing Consultant at Big Data Analytics Hires – a talent search and recruiting firm focused primarily on Data Science and Decision Science professionals.

Data Science 101: Choosing the Right NoSQL Database

Data Science

NoSQL includes a wide range of different database technologies and was developed as a result of surging volume of data stored. The presentation below covers a number of topics to help you choose the right NoSQL database for your application