Sign up for our newsletter and get the latest big data news and analysis.

Book Review: Data Lake Architecture

A new book “Data Lake Architecture – Designing the Data Lake and Avoiding the Garbage Dump” by the father of the data warehouse Bill Inmon is a simple, high-level introduction to this popular data organization. Written for enterprise thought-leaders and decision makers, the book offers a one-stop resource that explains how to build a useful data lake where data scientists and data analysts can solve business challenges and identify new business opportunities. Readers will learn how to structure data lakes as well as analog, application, and text-based data ponds to provide maximum business value.

GridGain Professional Edition 1.6 Release Adds Native Support for Apache® Cassandra™

GridGain Systems, provider of enterprise-grade In-Memory Data Fabric solutions based on Apache® Ignite™, announced the availability of GridGain Professional Edition 1.6, an in-memory computing platform enabling high-performance transactions that run 1,000x faster than disk-based approaches.

Datos IO Introduces RecoverX, Scale-Out Data Protection Software for Cloud Native and Big Data Environments

Datos IO, a provider of next-generation data protection solutions, announced the general availability of RecoverX, scale-out data protection software for third platform applications and distributed and cloud databases.

GridGain Helps e-Therapeutics Find Treatments for Biocomplex Diseases

GridGain Systems, provider of enterprise-grade In-Memory Data Fabric solutions based on Apache® Ignite™, announced that e-Therapeutics plc (LSE ETX), a U.K.-based drug discovery and development group, is using the GridGain In-Memory Data Fabric to run hundreds of thousands of computational analyses in minutes.

It’s Time for Reinventing Data Services

In the contributed article blow, Yaron Haviv, CTO and founder of iguaz.io, observes that during the last decades, the IT industry has used and cultivated the same storage and data management stack. The problem is, everything around those stacks changed from the ground up — including new storage media, distributed computing, NoSQL, and the cloud. Learn about a better way.

A Call to IT Operations – Get out of the Bunker and Lead the IT Innovation Charge

In this contributed article, Russ Elsner, Architect – Office of the of the CTO at ScienceLogic, discusses the challenges of running a modern IT operations infrastructure where innovation is key.

GridGain Announces Support Offering for Apache® Ignite™

GridGain Systems, provider of enterprise-grade In-Memory Data Fabric solutions based on Apache® Ignite™, announced the availability of its Standard Professional Support subscription, which includes a license for the new GridGain In-Memory Data Fabric – Professional Edition 1.5, a fully supported version of Apache Ignite.

The Lambda Architecture Simplified

In this special technology white paper, The Lambda Architecture Simplified, you’ll learn about how the Lambda Architecture aims to satisfy the needs for a robust system that is fault-tolerant, both against hardware failures and human mistakes, being able to serve a wide range of workloads and use cases, and in which low-latency reads and updates are required.

Galactic Exchange Launches Into Big Data Space With 5 Minute Set-Up Spark/Hadoop Powered Clusters

Galactic Exchange, Inc. officially came out of stealth mode this week to announce initial beta availability of ClusterGX™, an open source clustering solution which provides unprecedented simplicity of deployment and management of Spark/Hadoop clusters.

Headed for the Cloud? Watch Out: Legacy Data Integration Can Bring You Down

In this contributed article, Darren Cunningham, Vice President of Marketing at SnapLogic, discusses how most organizations will have at least some portion of their business reliant on some kind of cloud platform. But whether an organization has its various components completely in the cloud or on a mixture of cloud and on-prem platforms, there’s still going to be the issue of how to organize and integrate all the data pulled from each source.