As 2013 draws to a close, it is time for industry analysts and pundits to present their assessments of the year in order to predict where Big Data is headed in 2014. Forbes recently came out with a useful summary of predictions from IDC and IIA which could serve as a balanced road map for […]
Monte Carlo methods are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results, i.e. by running simulations many times in succession in order to calculate those same probabilities with machine learning just like actually playing and recording your results in a real casino situation: hence the name.
The R statistical environment is renowned for struggling with large data sets. To resolve that limitation, HP Labs and HP Vertica have developed Distributed R, a scalable and high-performance platform for the R language. It splits tasks between multiple processing nodes (cores or nodes of a cluster) to vastly reduce execution time and gives users […]
2013 was a pretty good year for data and visualizations. Visualization in particular continues to mature and focus more on the data first than on novel designs and magnitude of scope. Our friends over at FlowingData have put together a thought-provoking REVIEW of many of the year’s most compelling visualizations.
As the demand for data scientists continues to rise on an exponential scale, the number of educational resources also is on a distinct upward slope. Here is a freshly updated list of graduate programs in big data analytics and data science.
Music recognition apps continue to capture the fascination of music lovers around the globe with millions of searches per day. The major players in this space are companies such as Shazam, SoundHound and musiXmatch. Shazam is one company that’s taking a significant step forward by using its ever growing music database to predict next year’s likely movers-and-shakers.