Archives for 2013

Clojure for Data Science

Data science/big data exists at the crossroads of traditional analytics and large scale computation. As such, neither the traditional tools of analytics (R, Mathematica, Matlab) nor mainstreams languages (Java, C++, C#) supply its requirements precisely as they cannot simultaneously provide the required mathematical abstractions and real-word platform power. The Clojure language is unique in that it has the potential to provide the best of both worlds.

Machine Learning Research at arXiv.org

I’d like to acquaint you with a tremendous resource for keeping current with the latest research in the field of machine learning. Informally known as the pre-print server, arXix.org is the global repository for the fields of Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics.

The 8 Critical Elements of Big Data Success

Big data and data analytics initiatives are large, complex projects with multiple layers and dimensions. In this brief video, Avnet CIO Steve Phillips lays out the eight critical elements needed to maximize the ROI of your big data or data analytics investment.  

DataSift Announces Intelligence Engine for Unstructured Social Data

DataSift, touted as the platform that powers the social economy, recently announced the availability of DataSift VEDO, a new processing engine for the DataSift platform, featuring Programmable Intelligence.

Big Data Humor: Random News

Ever wonder just where our elucidative news sources get their data? Coffee is good for you, coffee is bad for you, coffee is maybe OK for you …

Free 2013 Data Miner Survey

As 2013 draws to a close, a number of year-end surveys are coming out to assess the progress in our industry. I enjoy going through these results to get a pulse of big data and how it’s being received in the business community. Here is one valuable survey published annually for free: Rexer Analytics 6th Data Miner Survey for 2013.

TECH TIP: Generalization in Machine Learning

I recently ran across a blog post that discusses a very important characteristic for machine learning solutions – Generalization. If you’ve ever wondered about the primary reason why machines can learn, generalization is the concept you need to understand. It is the premise underlying all statistical learning and it goes something like this. We start […]

Slidecast: Introducing StorNext5 Appliances

“For customers who need to retain and access hundreds of terabytes of unstructured data, Quantum Lattus Object Storage is a self-healing, self-protecting private cloud solution that enables more efficient primary storage usage, delivers extreme archive data resiliency and protection, and offers low latency disk access to archive data. Compared to RAID or tape storage, Lattus Object Storage provides the most effective solution on a cost/performance basis for active access, retention and protection of unstructured data in large archive environments.”

Machine Learning: A Brief History

Machine learning has resulted in the development of solutions that are getting exponentially better each year. Already, algorithms using machine learning can drive cars, grade essays, write magazine articles, and read and understand newspapers. In the video presentation below from the recent TEDxSF conference, data scientist Jeremy Howard explains what the state of machine learning […]

Book Review: Big Data Marketing by Lisa Arthur

I’ve been promoting the concept of Computational Marketing for some time here on insideBIGDATA and elsewhere because it is quite clear that modern marketing is nothing without data. So it’s great to see a new book aligned with this notion – Big Data Marketing by Lisa Arthur. The data driven movement is marching forward at […]