The Cube brings us this live streaming video from Hadoop World, which wraps up today in New York.
In this special guest feature, David Kolinek, Vice President of Product, Ataccama, asks why data quality is so important? Sometimes taking a step back and reviewing the basics can help clear things up. DQ is not a single activity but a series of actions that draw upon multiple resources and functions, all focused on making data usable in a purposeful way. It falls under data management, which has an overriding mission of delivering a view of datasets from various perspectives, enabling the type and quality level of data to be assessed.
[READ MORE...]This whitepaper from Imply Data Inc. introduces Apache Druid and explains why delivering real-time analytics on a data lake is so hard, approaches companies have taken to accelerate their data lakes, and how they leveraged the same technology to create end-to-end real-time analytics architectures.
Leave a Comment