Sign up for our newsletter and get the latest big data news and analysis.

How to Overcome Obstacles in Data Lake and Warehouse Strategies: 3 Best Practices for Enterprise Architects

In this special guest feature, Kimberly Read, the enterprise architect at Faction, suggests that to support the business case for multi-cloud, enterprise architects can benefit by addressing three primary considerations. Multi-cloud initiatives—drawing on services from public and private clouds—can help organizations stay ahead of the curve.

Interview: Prat Moghe, CEO of Cazena

I recently caught up with Prat Moghe, CEO of cloud data lake leader Cazena to get his take on how getting off the ground with cloud data lakes continues to be a major frustration for enterprises. We’re seeing such deployments taking at least six months and millions of dollars of annual spend for in-house development and management. There’s got to be a better way. Gartner has estimated the failure rate of big data projects as high as 80%. What can you do about companies that stubbornly hang on to legacy data strategies, using analytics/BI approaches that put them ever-more behind competitors who are modernizing their data stack with AI/ML/etc? In this interview, we’ll get some valuable perspectives for you to follow in accelerating your time-to-analytics.

Protecting Your Data Lake Requires a New Mindset

In this contributed article, technologist Bernard Brode discusses how protecting your data lake requires a new mindset. Some have questioned whether they actually need a data lake. While the answer is often positive, this doesn’t mean you’re doomed to the certainty of successful cyber attacks – as long as you are not relying on hope as a strategy, that is.

Open Source Innovations to Be Unveiled at Subsurface LIVE Winter 2021 Cloud Data Lake Conference

Dremio, the innovation leader in data lake transformation, announces the speaker lineup and full agenda for Subsurface LIVE Winter 2021, a two-day, live conference about the future of the cloud data lake industry. The virtual event takes place January 27-28, 2021, and features keynotes from senior executives of AWS, Tableau and Dremio, as well as 30+ technical sessions on the open source innovations, trends and strategies driving cloud data lake transformation and architectures.

Databricks Launches SQL Analytics to Enable Cloud Data Warehousing on Data Lakes

Databricks, the data and AI company, announced the launch of SQL Analytics, which for the first time enables data analysts to perform workloads previously meant only for a data warehouse on a data lake. This expands the traditional scope of the data lake from data science and machine learning to include all data workloads including Business Intelligence (BI) and SQL.

The Value of Data Now vs. Data Later

In this contributed article, Fluency CEO and Founder Chris Jordan discusses the inevitable extinction of Moore’s law. 90% of the world’s data has been produced over the last two years, yet companies only analyze 12% of it. With Big Data only continuing to grow, how can more innovative data storage solutions, such as the cloud, effectively respond to this level of growth?

Real-Time Analytics from Your Data Lake Teaching the Elephant to Dance

This whitepaper from Imply Data Inc. explains why delivering real-time analytics on a data lake is so hard, approaches companies have taken to accelerate their data lakes, and how they leveraged the same technology to create end-to-end real-time analytics architectures.

Real-Time Analytics from Your Data Lake Teaching the Elephant to Dance

This whitepaper from Imply Data Inc. introduces Apache Druid and explains why delivering real-time analytics on a data lake is so hard, approaches companies have taken to accelerate their data lakes, and how they leveraged the same technology to create end-to-end real-time analytics architectures.

Introducing Apache Druid

Sponsored Post Apache Druid was invented to address the lack of a data store optimized for real-time analytics. Druid combines the best of real-time streaming analytics and multidimensional OLAP with the scale-out storage and computing principles of Hadoop to deliver ad hoc, search and time-based analytics against live data with sub-second end-to-end response times. Today, […]

Introducing Apache Druid

This whitepaper provides an introduction to Apache Druid, including its evolution,
core architecture and features, and common use cases. Founded by the authors of the Apache Druid database, Imply provides a cloud-native solution that delivers real-time ingestion, interactive ad-hoc queries, and intuitive visualizations for many types of event-driven and streaming data flows.