Sign up for our newsletter and get the latest big data news and analysis.

Impetus Technologies Unveils New, TensorFlow-Based Deep Learning Feature on Apache Spark for StreamAnalytix

Impetus Technologies, a big data software products and services company, announced integration of a new, deep learning capability for its StreamAnalytix™ platform. Based on the TensorFlow™ open source software library for machine learning, this new capability demonstration showcases an image recognition application running on an Apache Spark Streaming pipeline on StreamAnalytix.

Interview: Patrick Moakley, Director of Marketing for HPC & AI at Lenovo

I recently caught up with Patrick Moakley, Director of Marketing for High Performance Computing (HPC) & Artificial Intelligence (AI) at Lenovo, to get his insights about this fast paced industry. Please view the video interview below to hear what Pat has to say.

Databricks Simplifies and Scales Deep Learning with New Apache Spark Library

Databricks, the company founded by the creators of the popular Apache Spark project, announced Deep Learning Pipelines, a new library to integrate and scale out deep learning in Apache Spark.

Interview: Matt Winkler, Group Program Manager for Machine Learning at Microsoft

In this podcast interview, we caught up with Matt Winkler, Group Program Manager for Machine Learning at Microsoft, to get his take on the upward trajectory of data science, machine learning and the cloud – specifically Azure. Matt leads a team crafting tools and services to enable data scientists and developers to do more with their data. Originally from St. Louis, Matt has been at Microsoft for 11 years working on developer tools and cloud services such as the .NET Framework, Visual Studio, Azure Websites, Data Lake and HDInsight.

Cloudera Launches Altus to Simplify Big Data Workloads in the Cloud

Cloudera, Inc, (NYSE:CLDR) the provider of a leading modern platform for machine learning and advanced analytics, announced the release of Cloudera Altus, a Platform-as-a-Service (PaaS) offering that makes it easier to run large-scale data processing applications on public cloud.

Hadoop, Spark or Both?

In this contributed article, tech writer Blake Davies asks the question: Spark or Hadoop? This question has recently sparked various discussions throughout the online communities. Even though these two work on different principles, they can be applied in a same way for various uses. While Hadoop is a household name in the world of big data processing, Spark is still building a name for itself and it’s doing so with “style”.

Pepperdata® Code Analyzer for Apache Spark Highlights Performance Bottlenecks for Developers

Pepperdata, the DevOps for Big Data company, announced Pepperdata Code Analyzer for Apache Spark, which provides Spark application developers the ability to identify performance issues and connect them to particular blocks of code within an application. Code Analyzer is a new product that follows on the heels of Pepperdata Application Profiler, which provides Hadoop and Spark developers with actionable recommendations for improving job performance.

Field Report: GPU Technology Conference 2017

I was very pleased to attend the GPU Technology Conference 2017 as the guest of host company NVIDIA on May 8-11 in Silicon Valley. This was my second GTC as I became acquainted with the GPU (graphics processing unit) universe last year while attending the conference. This Field Report chronicles what I saw and I’m delighted to share my experience with all of you!

MapR Releases New Ecosystem Pack with Optimized Security and Performance for Apache Spark

MapR Technologies, Inc., the provider of the Converged Data Platform that converges the essential data management and application processing technologies on a single, horizontally scalable platform, announced its next major release of the MapR Ecosystem Pack (MEP) program. MEP is a broad set of open source ecosystem projects that enable big data applications running on the MapR Converged Data Platform with inter-project compatibility.

Databricks Launches New Edition of Its Spark-Based Cloud Platform for Data Engineers

Databricks, the company founded by the creators of the popular Apache Spark project and providers of the leading Spark-based cloud platform for data science, announced an edition of its cloud platform optimized specifically for data engineering workloads called Databricks for Data Engineering.