Sign up for our newsletter and get the latest big data news and analysis.

ClearStory’s Spark-Powered Data Analysis Teams Up with Google Cloud Dataflow

ClearStory-Data-LogoClearStory Data, the company bringing business-oriented Data Intelligence to everyone, announced a new collaboration with the Google Cloud Platform to speed data analysis on large, diverse data to answer new business questions. By integrating ClearStory’s Spark-based analysis engine and its built-in data prep and data harmonization capabilities with Google Cloud Dataflow, users can analyze more data, explore and reach fast-cycle, business-ready insights. The combined solution is an intuitive, scalable solution that is language-agnostic and can be used to perform complex batch or streaming data processing and analytics. ClearStory Data announces general availability of new solution today.

The combination of Google Cloud Dataflow with ClearStory’s Spark-based analysis engine, built-in data harmonization capabilities and intuitive user interface, democratizes access to fast-cycle insights so everyone in business can make informed, timely, data-driven decisions using up-to-the-minute data analysis from disparate sources. The resulting data analysis is visualized using ClearStory’s Interactive, Collaborative Storyboards™ that let users share and collaborate on insights, ask new questions, and create fresh analysis on various data groupings.

Through our new product integration with Google Cloud Dataflow, our customers can quickly distill meaningful insights from large, disparate data sets to reach smarter answers faster based on what’s happening that impacts their businesses now,” said Tim Howes, chief technology officer at ClearStory Data. “By combining Google Cloud Dataflow for its powerful data transformation pipeline capabilities with ClearStory’s Spark-based harmonization and visual, interactive StoryBoards for collaboration, businesses can ask data-driven questions and easily collaborate with less IT dependency.”

Customer benefits of using ClearStory Data and Google Cloud Dataflow include:

  • Reduced cost and time saved when processing and analyzing large datasets. It automatically optimizes your data-centric pipeline code by collapsing multiple logical passes into a single execution pass.
  • Fast, out-of-the-box data access and data prep and inference – ClearStory provides highly scalable and Spark-powered data access in Google Cloud Dataflow and vice versa. Upon accessing data from Google Cloud Dataflow’s pipeline, ClearStory’s Data Inference Engine determines attributes in the source data to accelerate data prep and data harmonization, eliminating traditional, lengthy, complex data prep operations.
  • Fast-cycle analysis: Users can select disparate data to be blended and harmonized. ClearStory’s solution and built-in Intelligent Data Harmonization™ automates blending more sources and complex data to deliver immediate holistic, visual insights.
  • Easy business consumption of data insights: Business users are empowered to be more self-reliant in asking new questions and iterating on answers quickly with ClearStory’s Interactive, Collaborative StoryBoards™ that capture the latest insights. As data refreshes, users can more easily see, collaborate, and answer key business questions on a fast cycle. This enables consistent, faster, data-driven decisions.
  • Simplicity for technical users: Cloud Dataflow makes it easy to write data-processing pipelines that incorporate both batch and stream-processing capabilities that are language-agnostic.
  • Increased efficiencies: Full lifecycle management of required compute resources, in order to reduce burden related to resource management and cluster operations. ClearStory’s data harmonization capabilities reduce IT-dependency and frees up those technical resources to work on more strategic company initiatives.

 

Sign up for the free insideBIGDATA newsletter.

Leave a Comment

*

Resource Links: