Unsupervised statistical learning is growing in importance for the discovery of previously unknown knowledge in enterprise data assets. The presentation below by Derek Kane provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models. He goes through some methods of calibration and diagnostics and then applies the technique on a recognizable data set. The slides for the talk can be found HERE.
Sign up for the free insideBIGDATA newsletter.