Archives for 2013

Panel: Why Open Stack is Game Changer for Big Data

“Can Big Data on OpenStack become a first class citizen just like any other framework? How does new features such as Bare Metal support effect the use of Big Data on OpenStack?”

Data Science 101: Probability and Monte Carlo Methods

Monte Carlo methods are a broad class of computational algorithms that rely on repeated random sampling to obtain numerical results, i.e. by running simulations many times in succession in order to calculate those same probabilities with machine learning just like actually playing and recording your results in a real casino situation: hence the name.

HP Vertica Distributed R: Advanced Analytics for Big Data

The R statistical environment is renowned for struggling with large data sets. To resolve that limitation, HP Labs and HP Vertica have developed Distributed R, a scalable and high-performance platform for the R language. It splits tasks between multiple processing nodes (cores or nodes of a cluster) to vastly reduce execution time and gives users […]

Big Data Humor: The Real Customer Segmentation

Really?! This sounds like a recent sales meeting I had with a prospect who didn’t have a great experience with the last “data scientist” who wasn’t really a data scientist!

Panel: How Big Data is Changing our Lives

In this video, the Bipartisan Policy Center, in partnership with Intel, hosts a conversation on the far-reaching implications of the big data “information revolution” for innovation, economic growth, privacy and security in our increasingly connected world.

Visualization of the Week: 2013 Year in Review

2013 was a pretty good year for data and visualizations. Visualization in particular continues to mature and focus more on the data first than on novel designs and magnitude of scope. Our friends over at FlowingData have put together a thought-provoking REVIEW of many of the year’s most compelling visualizations.

The Superstars of Kaggle Competitions

Kaggle is the de facto standard for data science competitions. The organization has made a huge splash in this space by providing a platform for pushing the limits of machine learning technology by assisting all types of enterprises in getting more value from their data assets.

80+ Graduate Programs in Analytics and Data Science

As the demand for data scientists continues to rise on an exponential scale, the number of educational resources also is on a distinct upward slope. Here is a freshly updated list of graduate programs in big data analytics and data science.

Using Big Data to Predict Music’s Next Big Artists

Music recognition apps continue to capture the fascination of music lovers around the globe with millions of searches per day. The major players in this space are companies such as Shazam, SoundHound and musiXmatch. Shazam is one company that’s taking a significant step forward by using its ever growing music database to predict next year’s likely movers-and-shakers.

Create a Christmas Tree Data Model in 30 Seconds

It’s Seasons Greetings time again and our friends over at data visualization vendor Miner3D thought you would enjoy seeing the power of their new Miner3D ONE software on something Christmas related. Here is a video showing the use of simulated data in creating a special Yuletide data model.