Sign up for our newsletter and get the latest big data news and analysis.

insideBIGDATA Guide to Optimized Storage for AI and Deep Learning Workloads

Artificial Intelligence (AI) and Deep Learning (DL) represent some of the most demanding workloads in modern computing history as they present unique challenges to compute, storage and network resources. In this technology guide, insideBIGDATA Guide to Optimized Storage for AI and Deep Learning Workloads, we’ll see how traditional file storage technologies and protocols like NFS restrict AI workloads of data, thus reducing the performance of applications and impeding business innovation. A state-of-the-art AI-enabled data center should work to concurrently and efficiently service the entire spectrum of activities involved in DL workflows, including data ingest, data transformation, training, inference, and model evaluation.

insideBIGDATA Guide to Optimized Storage for AI and Deep Learning Workloads

This new technology guide from DDN shows how optimized storage has a unique opportunity to become much more than a siloed repository for the deluge of data constantly generated in today’s hyper-connected world, but rather a platform that shares and delivers data to create competitive business value. The intended audience for this important new technology guide includes enterprise thought leaders (CIOs, director level IT, etc.), along with data scientists and data engineers who are a seeking guidance in terms of infrastructure for AI and DL in terms of specialized hardware. The emphasis of the guide is “real world” applications, workloads, and present day challenges.

When Data-Driven Meets Data Silos: Let the Fun Really Begin

In this special guest feature, Ed Thompson, CTO and co-founder at Matillion, believes that on balance, the systems that lead to having many data silos are a good thing; they indicate a business has the autonomy to choose the best systems in each department. This should make the business more efficient overall. However, the business needs data from all these systems.

Interview: Terry Deem and David Liu at Intel

I recently caught up with Terry Deem, Product Marketing Manager for Data Science, Machine Learning and Intel® Distribution for Python, and David Liu, Software Technical Consultant Engineer for the Intel® Distribution for Python*, both from Intel, to discuss the Intel® Distribution for Python (IDP): targeted classes of developers, use with commonly used Python packages for data science, benchmark comparisons, the solution’s use in scientific computing, and a look to the future with respect to IPD.

NuoDB 4.0 Expands Cloud-native and Cloud-agnostic Capabilities of Distributed SQL Database

NuoDB, the distributed SQL database company, unveiled NuoDB 4.0, featuring expanded cloud-native and cloud-agnostic capabilities with support for Kubernetes Operators and Google Cloud and Azure public clouds. This includes the recently announced Kubernetes Operator to simplify and automate database deployments in Red Hat OpenShift.

Data Lakes: The Future of Data Warehousing?

In this special guest feature, Adwait Joshi, CEO of DataSeers, sees data lakes as a modern take on big data. When you think of a lake, you cannot define its shape and size, nor can you define what lives in it and how. Lakes just form—even if they are man-made, there is still an element of randomness to them and it’s this randomness that helps us in situations where the future is, well, sort of unpredictable.

AI’s Increasing Role in Data Backups

In this special guest feature, Steve Blow, Technology Evangelist at Zerto, takes a look at four ways in which the combination of machine learning and IT resilience can have a profound impact on the way the technology and IT industry operates.

YugaByte Commits to 100 Percent Open Source with Apache 2.0 License

YugaByte, a leader in open source distributed SQL databases, announced that YugaByte DB is now 100 percent open source under the Apache 2.0 license, bringing previously commercial features into the open source core. The move, in addition to other updates available now through YugaByte DB 1.3, allows users to more openly collaborate across what is now the world’s most powerful open source distributed SQL database.

3 Considerations for Working with Data Gravity

In this special guest feature, Matthew Wallace, Faction Chief Technology Officer, addresses three questions enterprise companies should consider in order to understand data gravity so that they may access and use data—efficiently and effectively.

Best of arXiv.org for AI, Machine Learning, and Deep Learning – June 2019

In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning – from disciplines including statistics, mathematics and computer science – and provide you with a useful “best of” list for the month.