As 2013 draws to a close, it is time for industry analysts and pundits to present their assessments of the year in order to predict where Big Data is headed in 2014. Forbes recently came out with a useful summary of predictions from IDC and IIA which could serve as a balanced road map for […]
Archives for 2013
“Can Big Data on OpenStack become a first class citizen just like any other framework? How does new features such as Bare Metal support effect the use of Big Data on OpenStack?”
Monte Carlo methods are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results, i.e. by running simulations many times in succession in order to calculate those same probabilities with machine learning just like actually playing and recording your results in a real casino situation: hence the name.
The R statistical environment is renowned for struggling with large data sets. To resolve that limitation, HP Labs and HP Vertica have developed Distributed R, a scalable and high-performance platform for the R language. It splits tasks between multiple processing nodes (cores or nodes of a cluster) to vastly reduce execution time and gives users […]
Really?! This sounds like a recent sales meeting I had with a prospect who didn’t have a great experience with the last “data scientist” who wasn’t really a data scientist!
In this video, the Bipartisan Policy Center, in partnership with Intel, hosts a conversation on the far-reaching implications of the big data “information revolution” for innovation, economic growth, privacy and security in our increasingly connected world.
2013 was a pretty good year for data and visualizations. Visualization in particular continues to mature and focus more on the data first than on novel designs and magnitude of scope. Our friends over at FlowingData have put together a thought-provoking REVIEW of many of the year’s most compelling visualizations.
Kaggle is the de facto standard for data science competitions. The organization has made a huge splash in this space by providing a platform for pushing the limits of machine learning technology by assisting all types of enterprises in getting more value from their data assets.
As the demand for data scientists continues to rise on an exponential scale, the number of educational resources also is on a distinct upward slope. Here is a freshly updated list of graduate programs in big data analytics and data science.
Music recognition apps continue to capture the fascination of music lovers around the globe with millions of searches per day. The major players in this space are companies such as Shazam, SoundHound and musiXmatch. Shazam is one company that’s taking a significant step forward by using its ever growing music database to predict next year’s likely movers-and-shakers.