Sign up for our newsletter and get the latest big data news and analysis.

Healthy Hives: Cloud Analytics Helps Save the World’s Bee Population

In this machine learning cast study, we describe how cloud analytics technology is being applied to the Global Hive Network, an initiative to collect billions of individual data points from around the world and analyze them to understand the honeybee population’s overall health and its relationship with environments, weather patterns, forage, diseases, parasites, predator species, and pesticides.

How Astera Labs is Revolutionizing Semiconductor Product Development—100% in the Cloud

For any established semiconductor product developer, designing a next-generation PCIe 5.0 chipset in less than a year is no small feat. For a brand-new startup with no compute infrastructure other than laptops, however, it is a huge ask. That’s why, with time being of the essence, Astera Labs decided to take a chance on the efficiencies it would gain from a 100% cloud-based approach.

Interview: Nancy Duarte, Author and CEO of Duarte, Inc.

I recently caught up with Nancy Duarte, CEO of Duarte, Inc. and author of DataStory: Explain Data and Inspire Action Through Story to discuss her views on a topic that’s critical to successful data science projects – data storytelling. Nancy has contributed her expertise to MIT and Forbes and is a regular contributor to Harvard Business Review and Linkedin’s Influencer Program and can be heard on Lewis Howes, Art of Charm and Entrepreneur on Fire. She has an engaged following of 235,000+.

DAOS Delivers Exascale Performance Using HPC Storage So Fast It Requires New Units of Measurement

Forget what you previously knew about high-performance storage and file systems. New I/O models for HPC such as Distributed Asynchronous Object Storage (DAOS) have been architected from the ground up to make use of new NVM technologies such as Intel® Optane™ DC Persistent Memory Modules (Intel Optane DCPMMs). With latencies measured in nanoseconds and bandwidth measured in tens of GB/s, new storage devices such as Intel DCPMMs redefine the measures used to describe high-performance nonvolatile storage.

StreamSets Launches StreamSets Transformer

StreamSets, Inc., provider of the DataOps platform for modern data integration, released StreamSets® Transformer, a simple-to-use, drag-and-drop UI tool to create native Apache Spark applications. Designed for a wide range of users — even those without specialized skills — StreamSets Transformer enables the creation of pipelines for performing ETL, stream processing and machine-learning operations. Now, data engineers, scientists, architects and operators gain deep visibility into the execution of Apache Spark while broadening usage across the business.

Interview: KDD2019 Co-General Chairs Ankur Teredesai & Vipin Kumar

During my trip to KDD2019 in August, I had the pleasure of sitting down to chat with the co-chairs of the conference, Ankur Teredesai and Vipin Kumar. In the interview that follows, we discuss the growth of the KDD conference over the years, and also it’s changing focus. KDD is touted as being “the premier interdisciplinary conference bringing together researchers and practitioners from data science, data mining, knowledge discovery, large-scale data analytics, and big data.”

Interview: Terry Deem and David Liu at Intel

I recently caught up with Terry Deem, Product Marketing Manager for Data Science, Machine Learning and Intel® Distribution for Python, and David Liu, Software Technical Consultant Engineer for the Intel® Distribution for Python*, both from Intel, to discuss the Intel® Distribution for Python (IDP): targeted classes of developers, use with commonly used Python packages for data science, benchmark comparisons, the solution’s use in scientific computing, and a look to the future with respect to IPD.

Develop Multiplatform Computer Vision Solutions with Intel® Distribution of OpenVINO™ Toolkit

Realize your computer vision deployment needs on Intel® platforms—from smart cameras and video surveillance to robotics, transportation, and much more. The Intel® Distribution of OpenVINO™ Toolkit (includes the Intel® Deep Learning Deployment Toolkit) allows for the development of deep learning inference solutions for multiple platforms.

Field Report: KDD 2019

As a very long time member of the ACM and their SIGKDD group, I’d always wanted to attend a KDD conference (first one occurred in 1995). This year I received a gracious invitation to attend KDD2019 in Anchorage, Alaska, August 4-8. It satisfied two of my bucket list items: witnessing a KDD first-hand and also […]

The AI Opportunity

The tremendous growth in compute power and explosion of data is leading every industry to seek AI-based solutions. In this Tech.Decoded video, “The AI Opportunity – Episode 1: The Compute Power Difference,” Vice President of Intel Architecture and AI expert Wei Li shares his views on the opportunities and challenges in AI for software developers, how Intel is supporting their efforts, and where we’re heading next.