Apache Spark Archives - Page 3 of 6

Databricks Launches New Edition of Its Spark-Based Cloud Platform for Data Engineers

May 22, 2017 by Editorial Team Leave a Comment

Databricks, the company founded by the creators of the popular Apache Spark project and providers of the leading Spark-based cloud platform for data science, announced an edition of its cloud platform optimized specifically for data engineering workloads called Databricks for Data Engineering.

Filed Under: Big Data, Databricks, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Impetus Technologies Announces StreamAnalytix 3.0 Featuring Support for Apache Spark-Based Batch Processing

April 25, 2017 by Editorial Team Leave a Comment

Impetus Technologies, a big data thought leader and software solutions company, announced StreamAnalytix™ 3.0 featuring support for Apache Spark-based batch processing and enriched online and offline machine learning features, helping enterprises maximize the performance of their analytical models and achieve the most favorable business outcomes. The newest version adds to the stream processing capabilities driven […]

Filed Under: Analytics, Big Data, News / Analysis, Uncategorized Tagged With: Apache Spark, streaming analytics, Weekly Newsletter Articles

Data as a Critical Element in the Discovery and Delivery of Smart Energy

April 21, 2017 by Editorial Team Leave a Comment

In this contributed article, Jules S. Damji, an Apache Spark Community Evangelist with Databricks, shows how as the value of data continues to grow, the next-generation smart grid should become a reality, benefiting utility companies and consumers alike.

Filed Under: Big Data, Databricks, Energy, Featured, Google News Feed, inside SPARK, News / Analysis, Opinion, Uncategorized Tagged With: Apache Spark, Machine Learning, Weekly Featured Newsletter Post

Pepperdata Integrates Performance into DevOps for Big Data

April 4, 2017 by Editorial Team Leave a Comment

Pepperdata, the Big Data performance company, announced it is expanding its product portfolio with Pepperdata Application Profiler, providing Hadoop and Spark developers with easy to understand recommendations for improving job performance. Application Profiler is currently available in early access and will be generally available in the second quarter of 2017.

Filed Under: Big Data, Google News Feed, Hadoop, inside Hadoop, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Big Data, Hadoop, Weekly Newsletter Articles

EnterpriseDB Announces New Apache Spark Connecter to Speed Postgres Big Data Processing

March 3, 2017 by Editorial Team Leave a Comment

EnterpriseDB® (EDB™), the database platform company for digital business, announced the general availability of a new version of the EDB Postgres Data Adapter for Hadoop with compatibility for the Apache Spark cluster computing framework. The new version gives organizations the ability to combine analytic workloads based on the Hadoop Distributed File System (HDFS) with operational data in Postgres, using an Apache Spark interface.

Filed Under: Big Data, Data Storage, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Percipient Launches SparkPLUS to Solve Apache Spark’s Out-of-memory Problems

January 24, 2017 by Editorial Team Leave a Comment

Percipient, a Singapore-based startup, is launching a revolutionary solution to address the memory issues incurred by users of open source platform, Apache Spark. By delivering unified data a priori to the Spark platform, Percipient’s SparkPLUS solution is able to multiply the platform’s computing space, thereby greatly enhancing its utility for real time and analytical applications.

Filed Under: Big Data, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Monte Carlo Simulations in Ad-Lift Measurement Using Spark

January 7, 2017 by Editorial Team Leave a Comment

In this talk from Spark Summit East 2016, Prasad Chalasani explores some of the challenges that arise in setting up scalable simulations in a specific application, and share some solutions and lessons learned along the way, in the realms of mathematics and programming.

Filed Under: Big Data, inside SPARK, Main Feature, Uncategorized, Video Tagged With: Apache Spark

Structuring Apache Spark 2.0: SQL, DataFrames, Datasets And Streaming

December 2, 2016 by Editorial Team Leave a Comment

In the talk below, Michael Armbrust, gives an overview of some of the exciting new API’s available in Spark 2.0, namely Datasets and Structured Streaming. Together, these APIs are bringing the power of Catalyst, Spark SQL’s query optimizer, to all users of Spark.

Filed Under: Big Data, Big Data Software, Databricks, Google News Feed, inside SPARK, Main Feature, News / Analysis, Spark 101, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

IBM Unleashes the Power of Machine Learning with Watson-enabled Data Platform

November 26, 2016 by Editorial Team Leave a Comment

IBM (NYSE:IBM) announced IBM Watson Data Platform to help companies gain more valuable insights from data. The platform delivers the world’s fastest data ingestion engine and cognitive-powered decision-making to data professionals, allowing them to collaborate in the IBM Cloud, with the services they prefer. IBM is also making IBM Watson Machine Learning Service available – making machine learning simple with an intuitive, self-service interface.

Filed Under: Big Data, Featured, Google News Feed, IBM, inside SPARK, Machine Learning, News / Analysis Tagged With: Apache Spark, Machine Learning, Weekly Newsletter Articles

Databricks Sets New World Record for CloudSort Benchmark Using Apache Spark at $1.44 Per Terabyte

November 19, 2016 by Editorial Team Leave a Comment

Databricks®, the company founded by the the team that created the popular Apache® Spark™ project, announced that in collaboration with industry partners, it has broken the world record in the CloudSort Benchmark, a third-party industry benchmarking competition for processing large datasets.

Filed Under: Big Data, Big Data Software, Databricks, Featured, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Databricks Launches New Edition of Its Spark-Based Cloud Platform for Data Engineers

Impetus Technologies Announces StreamAnalytix 3.0 Featuring Support for Apache Spark-Based Batch Processing

Data as a Critical Element in the Discovery and Delivery of Smart Energy

Pepperdata Integrates Performance into DevOps for Big Data

EnterpriseDB Announces New Apache Spark Connecter to Speed Postgres Big Data Processing

Percipient Launches SparkPLUS to Solve Apache Spark’s Out-of-memory Problems

Monte Carlo Simulations in Ad-Lift Measurement Using Spark

Structuring Apache Spark 2.0: SQL, DataFrames, Datasets And Streaming

IBM Unleashes the Power of Machine Learning with Watson-enabled Data Platform

Databricks Sets New World Record for CloudSort Benchmark Using Apache Spark at $1.44 Per Terabyte

Sponsored Guest Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage

White Papers

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

Featured RSS Feed

More News from insideHPC