Spark 101: Online Approximate OLAP in SparkSQL

October 18, 2015 by Daniel Gutierrez 1 Comment

The Hadoop Summit 2015 talk below introduces G-OLA, a parallel approximate query engine built on top of BlinkDB and SparkSQL, that provides a radically different “online execution” paradigm to incrementally process massive amounts of data on clusters of hundreds or thousands of machine while returning approximate answers. G-OLA presents the user with a meaningful approximate result (with error bars) that is continuously refined at a speed comfortable to the user and enables them to control the query execution on the fly. The slides for this presentation are available HERE.

Sign up for the free insideBIGDATA newsletter.

Filed Under: inside SPARK, Main Feature, Uncategorized Tagged With: Weekly Newsletter Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage
[SPONSORED POST] Organizations can now confidently embrace Elastic, enhance their hot tier storage, and seamlessly manage historical data with cost-efficient capacity-optimized storage. Pure Storage not only meets the demands of the modern data landscape but also empowers organizations to simplify their Elastic architecture, reflecting the industry trend towards a more streamlined and efficient approach.

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

In today’s fast-paced world, driven by demands for speed and efficiency, the field of clinical development has undergone a remarkable transformation. The way trials are being conducted has changed significantly with decentralized clinical trials (DCT) becoming mainstream and the collection of clinical data from wearables and other remote-monitoring devices becoming common practice. While these advances […]

Download

Speak Your Mind Cancel reply

Comments

Revathy Hari says

October 21, 2015 at 12:31 am

Can spark be used for generate sequential patterns from dynamic streams of big data( especially considering dna sequences as data set)?

Reply

Spark 101: Online Approximate OLAP in SparkSQL

Sponsored Guest Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage

White Papers

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

Speak Your Mind Cancel reply

Comments

Featured RSS Feed

More News from insideHPC

Spark 101: Online Approximate OLAP in SparkSQL

Sponsored Guest Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage

White Papers

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

Join Us On Social Media

Speak Your Mind Cancel reply

Comments

Related Posts

Featured RSS Feed

More News from insideHPC