The inside Spark channel is a resource for professionals looking to learn about the benefits of Apache Spark

MapR Releases New Ecosystem Pack with Optimized Security and Performance for Apache Spark

May 23, 2017 by Editorial Team Leave a Comment

MapR Technologies, Inc., the provider of the Converged Data Platform that converges the essential data management and application processing technologies on a single, horizontally scalable platform, announced its next major release of the MapR Ecosystem Pack (MEP) program. MEP is a broad set of open source ecosystem projects that enable big data applications running on the MapR Converged Data Platform with inter-project compatibility.

Filed Under: Big Data, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Databricks Launches New Edition of Its Spark-Based Cloud Platform for Data Engineers

May 22, 2017 by Editorial Team Leave a Comment

Databricks, the company founded by the creators of the popular Apache Spark project and providers of the leading Spark-based cloud platform for data science, announced an edition of its cloud platform optimized specifically for data engineering workloads called Databricks for Data Engineering.

Filed Under: Big Data, Databricks, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

5 Common Myths Around Virtualizing Big Data (Number 3 is SANdalous!)

May 12, 2017 by Editorial Team Leave a Comment

In this contributed article, Justin Murray, Technical Marketing Manager at VMware, discusses 5 common myths around virtualizing big data. Big data burst on to the scene a little over a decade ago. Today it is not an obscure term confined to just a handful of bleeding edge companies. It is a mainstream trend that every enterprise undergoing a digital transformation journey has adopted. The technology landscape around big data has broadened dramatically.

Filed Under: Big Data, Featured, Google News Feed, inside Hadoop, inside SPARK, News / Analysis, Opinion, Uncategorized Tagged With: Big Data, data virtualization, Weekly Newsletter Articles

Information Builders Offers iWay Big Data Integrator on Microsoft Azure Marketplace Cloud

April 30, 2017 by Editorial Team Leave a Comment

Information Builders, a leader in business intelligence (BI) and analytics, data integrity, and integration solutions, announced that it is offering its iWay Big Data Integrator (iBDI) product via the cloud on the Microsoft Azure Marketplace.

Filed Under: Big Data, Hadoop, inside Hadoop, inside SPARK, News / Analysis, Uncategorized Tagged With: data integration, Weekly Newsletter Articles

Data as a Critical Element in the Discovery and Delivery of Smart Energy

April 21, 2017 by Editorial Team Leave a Comment

In this contributed article, Jules S. Damji, an Apache Spark Community Evangelist with Databricks, shows how as the value of data continues to grow, the next-generation smart grid should become a reality, benefiting utility companies and consumers alike.

Filed Under: Big Data, Databricks, Energy, Featured, Google News Feed, inside SPARK, News / Analysis, Opinion, Uncategorized Tagged With: Apache Spark, Machine Learning, Weekly Featured Newsletter Post

Pepperdata Integrates Performance into DevOps for Big Data

April 4, 2017 by Editorial Team Leave a Comment

Pepperdata, the Big Data performance company, announced it is expanding its product portfolio with Pepperdata Application Profiler, providing Hadoop and Spark developers with easy to understand recommendations for improving job performance. Application Profiler is currently available in early access and will be generally available in the second quarter of 2017.

Filed Under: Big Data, Google News Feed, Hadoop, inside Hadoop, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Big Data, Hadoop, Weekly Newsletter Articles

EnterpriseDB Announces New Apache Spark Connecter to Speed Postgres Big Data Processing

March 3, 2017 by Editorial Team Leave a Comment

EnterpriseDB® (EDB™), the database platform company for digital business, announced the general availability of a new version of the EDB Postgres Data Adapter for Hadoop with compatibility for the Apache Spark cluster computing framework. The new version gives organizations the ability to combine analytic workloads based on the Hadoop Distributed File System (HDFS) with operational data in Postgres, using an Apache Spark interface.

Filed Under: Big Data, Data Storage, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

The Leaky Pipeline Problem -  Making your Mark as a Woman in Big Data

February 15, 2017 by Editorial Team Leave a Comment

insideBIGDATA was on hand for the recent Spark Summit East 2017 conference in Boston, and one of the more compelling presentations was by Kavitha Mariappan, VP Marketing at Databricks. The talk focused on the premise that despite the tremendous growth and opportunities in big data today, women still play a small role in this arena.

Filed Under: Big Data, Databricks, Google News Feed, inside SPARK, Main Feature, News / Analysis, Uncategorized Tagged With: Big Data, Weekly Newsletter Articles

Percipient Launches SparkPLUS to Solve Apache Spark’s Out-of-memory Problems

January 24, 2017 by Editorial Team Leave a Comment

Percipient, a Singapore-based startup, is launching a revolutionary solution to address the memory issues incurred by users of open source platform, Apache Spark. By delivering unified data a priori to the Spark platform, Percipient’s SparkPLUS solution is able to multiply the platform’s computing space, thereby greatly enhancing its utility for real time and analytical applications.

Filed Under: Big Data, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Monte Carlo Simulations in Ad-Lift Measurement Using Spark

January 7, 2017 by Editorial Team Leave a Comment

In this talk from Spark Summit East 2016, Prasad Chalasani explores some of the challenges that arise in setting up scalable simulations in a specific application, and share some solutions and lessons learned along the way, in the realms of mathematics and programming.

Filed Under: Big Data, inside SPARK, Main Feature, Uncategorized, Video Tagged With: Apache Spark

Optimizing Performance and Cost Savings for Elastic on Pure Storage
[SPONSORED POST] Organizations can now confidently embrace Elastic, enhance their hot tier storage, and seamlessly manage historical data with cost-efficient capacity-optimized storage. Pure Storage not only meets the demands of the modern data landscape but also empowers organizations to simplify their Elastic architecture, reflecting the industry trend towards a more streamlined and efficient approach.

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

In today’s fast-paced world, driven by demands for speed and efficiency, the field of clinical development has undergone a remarkable transformation. The way trials are being conducted has changed significantly with decentralized clinical trials (DCT) becoming mainstream and the collection of clinical data from wearables and other remote-monitoring devices becoming common practice. While these advances […]

Download