The inside Spark channel is a resource for professionals looking to learn about the benefits of Apache Spark

ODPi Publishes Operations Specification Providing Developers Consistency Across Application Management Tools

December 18, 2016 by Editorial Team Leave a Comment

ODPi, a nonprofit organization accelerating the open ecosystem of big data solutions, announced the availability of ODPi 2.0, which includes the first release of the ODPi Operations Specification and the Runtime Specification 2.0, to standardize the development model for big data solution and application providers and help enterprises improve installation and management of Hadoop-based applications.

Filed Under: Big Data, Hadoop, inside Hadoop, inside SPARK, Main Feature, News / Analysis, Uncategorized Tagged With: Hadoop, Weekly Newsletter Articles

The Touchy-Feely Side of Spark

December 2, 2016 by Daniel Gutierrez Leave a Comment

In this special guest feature, Alex Bordei, head of product management at Bigstep, offers 5 examples of how Apache Spark has maximized its user experience – its feel.

Filed Under: Big Data, Big Data Software, Google News Feed, Industry Perspectives, inside SPARK, News / Analysis, Opinion, Uncategorized

Structuring Apache Spark 2.0: SQL, DataFrames, Datasets And Streaming

December 2, 2016 by Editorial Team Leave a Comment

In the talk below, Michael Armbrust, gives an overview of some of the exciting new API’s available in Spark 2.0, namely Datasets and Structured Streaming. Together, these APIs are bringing the power of Catalyst, Spark SQL’s query optimizer, to all users of Spark.

Filed Under: Big Data, Big Data Software, Databricks, Google News Feed, inside SPARK, Main Feature, News / Analysis, Spark 101, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

IBM Unleashes the Power of Machine Learning with Watson-enabled Data Platform

November 26, 2016 by Editorial Team Leave a Comment

IBM (NYSE:IBM) announced IBM Watson Data Platform to help companies gain more valuable insights from data. The platform delivers the world’s fastest data ingestion engine and cognitive-powered decision-making to data professionals, allowing them to collaborate in the IBM Cloud, with the services they prefer. IBM is also making IBM Watson Machine Learning Service available – making machine learning simple with an intuitive, self-service interface.

Filed Under: Big Data, Featured, Google News Feed, IBM, inside SPARK, Machine Learning, News / Analysis Tagged With: Apache Spark, Machine Learning, Weekly Newsletter Articles

Databricks Sets New World Record for CloudSort Benchmark Using Apache Spark at $1.44 Per Terabyte

November 19, 2016 by Editorial Team Leave a Comment

Databricks®, the company founded by the the team that created the popular Apache® Spark™ project, announced that in collaboration with industry partners, it has broken the world record in the CloudSort Benchmark, a third-party industry benchmarking competition for processing large datasets.

Filed Under: Big Data, Big Data Software, Databricks, Featured, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Apache Spark Survey Reveals Increased Growth in Users and New Workloads Including Exploratory Data Science and Machine Learning

November 8, 2016 by Editorial Team 1 Comment

In order to better understand Apache Spark’s growing role in big data, Taneja Group conducted a major market research project, surveying approximately 7,000 people. The sample was made up of technical and managerial job roles from around the world directly involved in big data.

Filed Under: Big Data, Big Data Software, Cloudera, Featured, Google News Feed, inside SPARK, News / Analysis, Research / Reports, Uncategorized Tagged With: Apache Spark, Weekly Newsletter Articles

Splice Machine Announces Native PL/SQL Support to Accelerate Migrations from Oracle to Hadoop

November 3, 2016 by Editorial Team Leave a Comment

Splice Machine, provider of the open-source SQL RDBMS powered by Hadoop and Spark, announced that it now supports native PL/SQL on Splice Machine.

Filed Under: Big Data, Featured, Google News Feed, inside Hadoop, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Hadoop, Weekly Newsletter Articles

Bigstep Launches High-Performance, Low-Latency Spark-as-a-Service for Real-Time Streaming Applications

October 28, 2016 by Editorial Team Leave a Comment

Bigstep, the big data cloud provider, today launched a bare-metal Spark-as-a-Service offering.

Filed Under: Big Data, Big Data Services, Big Data Software, Cloud, Featured, Google News Feed, inside SPARK, News / Analysis Tagged With: Apache Spark, Weekly Newsletter Articles

Databricks Adds Deep Learning Support to Cloud-Based Apache Spark Platform

October 27, 2016 by Daniel Gutierrez Leave a Comment

Databricks®, the company founded by the creators of the Apache® Spark™ project, today announced the addition of deep learning support to its cloud-based Apache Spark platform.

Filed Under: Big Data, Big Data Software, Databricks, Featured, Google News Feed, inside SPARK, News / Analysis, Uncategorized Tagged With: Apache Spark, Deep Learning, Weekly Newsletter Articles

Distributed System Architectures for Healthcare and Life Sciences

October 27, 2016 by Editorial Team Leave a Comment

The insideBIGDATA Guide to Healthcare & Life Sciences is a useful new resource directed toward enterprise thought leaders who wish to gain strategic insights into this exciting new area of technology. This segment focuses on the use of distributed system architectures – Hadoop and Spark.

Filed Under: Big Data, Big Data Software, Google News Feed, Hadoop, Healthcare, inside Hadoop, inside SPARK, Life Sciences, Main Feature, News / Analysis, Uncategorized, Use Cases, White Papers Tagged With: Apache Spark, Hadoop, Weekly Featured Newsletter Post

Optimizing Performance and Cost Savings for Elastic on Pure Storage
[SPONSORED POST] Organizations can now confidently embrace Elastic, enhance their hot tier storage, and seamlessly manage historical data with cost-efficient capacity-optimized storage. Pure Storage not only meets the demands of the modern data landscape but also empowers organizations to simplify their Elastic architecture, reflecting the industry trend towards a more streamlined and efficient approach.

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

In today’s fast-paced world, driven by demands for speed and efficiency, the field of clinical development has undergone a remarkable transformation. The way trials are being conducted has changed significantly with decentralized clinical trials (DCT) becoming mainstream and the collection of clinical data from wearables and other remote-monitoring devices becoming common practice. While these advances […]

Download