At the beginning of the new year it is interesting to learn how the important players in the big data ecosystem see the future. Our friends over at MapR Technologies asked their CEO and Co-founder, John Schroeder, to convey his views on how several major trends will occur in 2014 that will impact how companies store, process, analyze and share big data.
Companies are compelled to uncover real value in their data, and they’re turning to Hadoop as a key part of their data center strategy. In 2014, data centers will move away from trial deployments of Hadoop and into full production. To get the most out of Hadoop, it is important to understand where the technology is headed so that organizations can plan, strategize and invest appropriately.
Here’s how John sees what’s in store:
In 2013 companies increasingly came to rely on Hadoop to unlock the hidden value of their rapidly expanding data, in order to drive increased growth and profitability,” said John Schroeder. “In 2014 we’ll see organizations move en masse to deploy mainstream applications with mission-critical requirements that include high availability, better performance, data protection, security and ease of use. It will be an exciting year for those of us driving innovation to see how companies will use their data to realize measurable business advantage.”
- SQL simultaneously becomes the biggest promise and disappointment for Big Data. SQL development for Hadoop lets business analysts use their skills and SQL tools of choice for Big Data projects. But, SQL solutions that require pre-defining data structure and central manual administration, cause delays and impact scale. The goal is to enable SQL savvy analysts easy access to the structured and unstructured data in a scalable Hadoop platform; not re-create the wheel that exists in data warehouse environments.
- The 3 top security concerns in 2014: Authentication, authentication, authentication. With an onslaught of access control capabilities available in Hadoop, organizations quickly realize that wire level authentication is the required foundation.
- With 2014 comes the realization that data errors are “leads for optimization.” It’s not data quality, but data errors that occupy organizations in 2014. Do data errors indicate issues with underlying source systems? Are data errors the result of ETL issues that are introducing biases in downstream analysis? Do data errors indicate definitional differences or a lack of consistency across departments and business segments? 2014 will see the embracing of data anomalies.
- Emergence of operational Hadoop. 2014 will see a dramatic increase in production deployments of Hadoop by companies across industries. Such deployments will reveal the power of Hadoop in operations where production applications and analytics provide measureable business advantage.
- Majority of data warehouse environments will deploy an enterprise data hub. 2014 will see a majority of companies deploying Hadoop to offload ETL processing and data from enterprise data warehouses to Hadoop acting as a central enterprise hub that is ten times cheaper and can perform more analytics for additional processing or new applications.
- Every industry leader will deploy a new data centric application or they won’t be leading for long. The ability to leverage Big Data will emerge as the competitive weapon in 2014 as more companies will use Big Data and Hadoop to pinpoint individual consumers’ preferences for profitable upsell and cross-sell opportunities, better mitigate risk, and reduce production and overhead costs.
- The recognition that data is the center of the data center. In 2014 organizations will transition from developers driving the Big Data initiatives and increasingly task IT with defining the data infrastructure required to support diverse applications and focus on the infrastructure required to deploy, process and protect an organizations core asset.
- Search emerges as the Unstructured Query Language. In 2013 we saw a large number of SQL initiatives for Hadoop; 2014 will be the year that the “unstructured query language”, i.e. search, comes into full focus.
Sign up for the free insideBIGDATA newsletter.