Sign up for our newsletter and get the latest big data news and analysis.

DataStax Acquires Machine Learning Company Kaskada to Unlock Real-Time AI

DataStax, the real-time AI company, announced it has acquired Kaskada, a machine learning (ML) company that first solved managing, storing and accessing time-based data to train behavioral ML models and deliver the instant, actionable insights that fuel artificial intelligence (AI). Both DataStax and Kaskada have a track record of contributing to open source communities. Datastax will open source the core Kaskada technology initially, and it plans to offer a new machine learning cloud service later this year.

Snowflake vs. Databricks – Who has the Edge?

Fivetran recently unveiled the results of a new data warehouse benchmark report that revealed just how close the competition is among five of the most popular data warehouses. The report is the passion project of Fivetran CEO and data management expert George Fraser, who has a front row seat in the cloud data warehouse race. The report explains the cost vs. performance tradeoffs of each one of the warehouses, the ins and outs of the modern data stack, and provides a perspective on how it’s all going to shake out.

How to Ensure an Effective Data Pipeline Process

In this contributed article, Rajkumar Sen, Founder and CTO at Arcion, discusses how the business data in a modern enterprise is spread across various platforms and formats. Data could belong to an operational database, cloud warehouses, data lakes and lakehouses, or even external public sources. Data pipelines connecting this variety of sources need to establish some best practices so that the data consumers get high-quality data delivered to where the data apps are being built.

eBook: Unlock Complex and Streaming Data with Declarative Data Pipelines 

Our friend, Ori Rafael, CEO of Upsolver and advocate for engineers everywhere, released his new book “Unlock Complex and Streaming Data with Declarative Data Pipelines.” Ori discusses why declarative pipelines are necessary for data-driven businesses and how they help with engineering productivity, and the ability for businesses to unlock more potential from their raw data. Data pipelines are essential to unleashing the potential of data and can successfully pull from multiple sources.

The Right Way to Get Started with PostgreSQL

In this contributed article, Igor Levshin, Director of Content of Postgres Professional, suggests that as with all database systems, anyone just starting to learn about PostgreSQL can benefit from a clear, incremental approach to developing a strong skillset. This article outlines such an approach, which is also developed in far more detail – including step- by-step instructions and code samples – in “Postgres. The First Experience,” a free, downloadable book by Pavel Luzanov, Egor Rogov, and Igor Levshin.

Instaclustr Including PostgreSQL in Managed Data Platform – Now in Public Preview

Instaclustr, delivering reliability at scale through its fully managed platform for open source data technologies, announced the addition of PostgreSQL to its Managed Platform, now available in public preview for Instaclustr customers. Managed PostgreSQL offers complete database management and optimization, along with comprehensive support and monitoring backed by Instaclustr’s team of PostgreSQL experts.

SingleStore Research Highlights Spike in Data Demands Amid COVID-19 Pandemic

Many aspects of life and work stopped or slowed down significantly during the pandemic. But new research from SingleStore, the unified database for fast analytics, indicates that data requirements in the age of COVID-19 have been greater than ever. This research is based on a 500-person survey of IT professionals that Propeller Insights conducted in January 2021 on behalf of SingleStore.

Data Evolution in the Cloud: The lynchpin of competitive advantage

This report from our friends over at Snowflake reveals the extent to which the data sharing economy is powering business growth and how organizations are leveraging data from a range of sources to drive innovation, create better customer experiences, and meet regulatory requirements.

NoSQL vs SQL: Key Differences

In this contributed article, Alex Williams, Writer/Researcher at Hosting Data UK, indicates that NoSQL and SQL databases greatly differ on many points. One is not better than the other, but just like any technology, eventually, developers will have their preferences. Luckily, there are numerous options for database selection for both SQL and NoSQL databases.

The 6 Types of Data Everybody Should Know to Avoid Confusion

Everybody tosses the word “data,” but few actually know what it actually means and does. MountainTop Data CEO Sky Cassidy explains the 6 different kinds of data everyone should know something about in order to avoid confusion.