Sign up for our newsletter and get the latest big data news and analysis.

From Data Lake Curator to Data Lake Master: Best Practices for Making the Most of Your Data Lake

I recently caught up with Jennifer Cheplick, Sr. Director of Product Marketing at Syncsort, to discuss how the concept of a data lake is now commonplace, as businesses realize the value of having a single repository to house all enterprise data in its original, unaltered format. While this is a vital step in mastering a single view of all data assets, left unchecked, it can turn into a data swamp.

Data as a Critical Element in the Discovery and Delivery of Smart Energy

In this contributed article, Jules S. Damji, an Apache Spark Community Evangelist with Databricks, shows how as the value of data continues to grow, the next-generation smart grid should become a reality, benefiting utility companies and consumers alike.

Harte Hanks Brings Next Generation Data and Analytics to Marketers with Opera Solutions’ Signal Hub Platform

Harte Hanks ( NYSE : HHS ), a leader in customer relationships, experiences, and interaction-led marketing and Opera Solutions, a leader in Data Analytics, announced the market availability of their data and analytic solution delivered through Opera Solutions’ Artificial Intelligence (AI) and Machine Learning platform, Signal Hub™.

Interview: Anusua Trivedi, Data Scientist on Microsoft’s Advanced Data Science & Strategic Initiatives Team

In this podcast interview, I caught up with Anusua Trivedi, a Data Scientist on Microsoft’s ADS team, to get her take on the upward trajectory of AI and deep learning that we’re seeing in the industry today.

Data-Driven Healthcare: A Proactive Revolution

In this special guest feature, Richard Proctor, GM of Global Healthcare at Hortonworks, discusses how big data is enabling key healthcare organizations including UNOS, MD Anderson and Arizona State University to manage chronic diseases, improve overall member health, reduce costs, and manage clinical and financial risk.

Arcadia Data Accelerates Era of Data-Native Applications with Visual Analytics

Arcadia Data, provider of visual analytics software that solves the most complex big data problems, announced the launch of Arcadia Enterprise 4.0. The platform enhancements enable enterprises to build, brand, share and embed data-centric applications, ultimately making Apache Hadoop and cloud-based data lakes more accessible and valuable to all users within and outside an organization.

Interview: Bernd Harzog, CEO and Founder of OpsDataStore

I recently caught up with Bernd Harzog, CEO and Founder of OpsDataStore to discuss why data-driven IT operations are a prevailing theme in big data management. Bernd is responsible for the strategy, execution and financing activities of the company. Bernd founded OpsDataStore because every customer that he spoke to still had horrible service quality and capacity utilization problems despite a massive investment in either purchased or home grown tools.

Pepperdata Integrates Performance into DevOps for Big Data

Pepperdata, the Big Data performance company, announced it is expanding its product portfolio with Pepperdata Application Profiler, providing Hadoop and Spark developers with easy to understand recommendations for improving job performance. Application Profiler is currently available in early access and will be generally available in the second quarter of 2017.

BlueData Announces Bare-Metal Performance for Hadoop on Docker Containers

BlueData®, provider of the leading Big-Data-as-a-Service (BDaaS) software platform, announced breakthrough performance results. The results from a new Intel® benchmarking study show comparable performance for Hadoop when running in a bare-metal environment or in a containerized environment using the BlueData EPIC™ software platform.

Cloudera to Accelerate Data Science and Machine Learning for the Enterprise with New Data Science Workbench

Cloudera, the provider of a leading platform for machine learning and advanced analytics built on the latest open source technologies, today unveiled Cloudera Data Science Workbench, a new self-service tool for data science on Cloudera Enterprise which is currently in beta.