In this special guest feature, Supreet Oberoi, Vice President of Field Engineering of Concurrent, Inc. talks about how companies should change their perspective on their data strategies, and look at the process as building a data library as opposed to a data lake.
The pace at which the world creates data will never be this slow again. And much of this new data we’re creating is unstructured, textual data. Emails. Word documents. News articles. Blogs. Reviews. Research reports… Understanding what’s in this text – and what isn’t, and what matters – is critical to an organization’s ability to understand the environments in which it operates. Its competitors. Its customers. Its weaknesses and its opportunities.
A number of industries rely on high-performance computing (HPC) clusters to process massive amounts of data. As these same organizations explore the value of Big Data analytics based on Hadoop, they are realizing the value of converging Hadoop and HPC onto the same cluster rather than scaling out an entirely new Hadoop infrastructure.