The TIBCO Spark Connection

Print Friendly, PDF & Email

An Insider’s Guide to Apache Spark is a useful new resource directed toward enterprise thought leaders who wish to gain strategic insights into this exciting new computing framework. As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are  driving new opportunities for application development as well as increased intake of IT infrastructure. This article is the fifth and final in a series that explores a high-level view of how and why many companies are deploying Apache Spark as a solution for their big data technology requirements. The complete An Insider’s Guide to Apache Spark is available for download from the insideBIGDATA White Paper Library.

insideBIGDATA_Guide_SparkThe TIBCO Spark Connection

TIBCO is committed to the Spark framework and its continued innovation. For example, TIBCO Spotfire® Cloud provides a data discovery and advanced analytics connector to Apache Spark™ SQL, along with the industry’s first commercial integration with SparkR. Certified by Databricks, the Apache Spark project’s steward, the TIBCO Spotfire Connector for Apache Spark SQL provides data scientists and business users a flexible, seamless,  and easy way to query, analyze, and visualize Apache Hadoop® data. This brings the power of Spark SQL to the Spotfire in-memory data engine and  also its in-database connectivity.

To complement the visualization of big data via Spark SQL, Spotfire customers can now leverage the analytic power of Spark combined with deeper insights available through predictive, geospatial, and event-based analytics. Spotfire customers can now use SparkR through the high-performance TIBCO® Enterprise Runtime for R (TERR) engine that is also embedded in Spotfire and other TIBCO products. This means that business analysts  and users everywhere can run high-value analytics on big data via Spark from simple-to-use Spotfire interfaces and templates. And because SparkR  now runs with TERR, customers can rapidly solve bigger, more complex analytical problems on their existing Spark clusters, driving extreme value in  many core business applications.

The company also offers an existing Databrickscertified TIBCO Jaspersoft® Spark integration.

Summary

As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are driving new opportunities for application development as well as increased intake of IT infrastructure. Apache Spark is now the most active Apache project, with more than 600 contributions being made in the last 12 months by more than 200 organizations. A new survey conducted by Databricks—of 1,417 IT professionals working with Apache Spark finds that high-performance analytics applications that can work with big data are driving a large proportion of that demand. Apache  Spark is now being used to aggregate multiple types of data in-memory versus only pulling data from Hadoop. For solution providers, the Apache  Spark technology stack is a significant player because it’s one of the core technologies used to modernize data warehouses, a huge segment of the IT  industry that accounts for multiple billions in revenue.

Spark holds much promise for the future—with data lakes—a storage repository that holds a vast amount of raw data in its native format until it is needed. With Spark’s speed and scalability, data lakes can offer the enterprise a framework for virtually unlimited capacity.

If you prefer the complete An Insider’s Guide to Apache Spark is available for download in PDF from the insideBIGDATA White Paper Library, courtesy of TIBCO. Click HERE to take in a webinar event recorded on November 17, 2015.

 

Speak Your Mind

*

Comments

  1. Malli Karjuna says

    Thanks for sharing this valuable information.This is very useful to learners. TIBCO Online Training