Sign up for our newsletter and get the latest big data news and analysis.

Slidecast: Technical Overview of Cloudera Impala

In this slidecast, Justin Erickson from Cloudera presents a technical overview of Cloudera Impala, an SQL-on-Hadoop solution that enables users to do real-time queries of data stored in Hadoop clusters.

To avoid latency, Impala circumvents MapReduce to directly access the data through a specialized distributed query engine that is very similar to those found in commercial parallel RDBMSs. The result is order-of-magnitude faster performance than Hive, depending on the type of query and configuration.

Read the Full Story * Download the MP3 * View the slides * Subscribe on iTunesSubscribe to RSS

Leave a Comment