Sign up for our newsletter and get the latest big data news and analysis.

Actian Survey Shows Legacy Data Management Platforms Breaking Under the Demands of Modern Analytic Workloads and Data Volumes


The ability of legacy data warehousing platforms and BI tools to glean insights from big data is decreasing exponentially as more sources of data are poured into them, according to a new survey involving more than 250 senior executives. The survey, sponsored by Actian Corporation, found analytic workloads are failing in traditional technology environments that are pervasive across enterprise networks.

Physician Payments Transparency Data – a Powerful Source of Business Intel


Published spend transparency data requirements were introduced via the Open Payments program, as mandated in the Affordable Care Act[1], to raise awareness of the financial influence that drug and device manufacturers have on the so-called covered recipients (physicians and a defined set of teaching hospitals).

WPI Receives U.S. Dept. of Education Funding to Address Shortage in Big Data Computing Professionals


Addressing a critical need for enhancing the nation’s capacity for computer science research and teaching, the U.S. Department of Education has granted Worcester Polytechnic Institute (WPI) $885,834 through its Graduate Assistance in Areas of National Need (GAANN) program. The funding will provide six needs-based fellowships for computer science PhD students who will study big data computing.

Data-Centric Security – An Effective Line of Defense against Data Breaches

Eric Tilenius - BlueTalon

In this special guest feature, Eric Tilenius, CEO of BlueTalon, highlights the four main reasons why companies should adopt data-centric security.

Looking at Spark from a Hadoop Lens


This article is the third in a series that explores a high-level view of how and why many companies are deploying Apache Spark as a solution for their big data technology requirements.

IBM’s Machine Learning Technology Accepted as Apache Open Source Project


IBM (NYSE: IBM) today announced that its machine learning technology –SystemML –has been accepted as a project by the Apache Incubator open source project. Originally developed by IBM Research, and now used in IBM’s BigInsights data analytics platform, SystemML is a machine learning algorithm translator.

Spark MLlib: Making Practical Machine Learning Easy and Scalable


In this talk, Xiangrui Meng of Databricks shares his experience in developing MLlib. The talk covers both higher-level APIs, ML pipelines, that make MLlib easy to use, as well as lower-level optimizations that make MLlib scale to massive data sets.

Why Data Quality without Data Integrity is No Match for Today’s Business Demands

Bobby Koritala

In this special guest feature, Bobby Koritala, Chief Product Officer of Infogix, discusses data management best practices and why data quality without data integrity is no match for today’s business demands.

Book Review: Doing Math with Python


When one of my favorite independent tech book publishers, No Starch Press, notified me about their new title “Doing Math with Python,” I was energized to review what potentially could be a good new resource for budding data scientists.

OpsDataStore Unveils Solution to Ensure Online Service Quality for Dynamic IT Environments


OpsDataStore, the company delivering a solution to improve online service quality across heterogeneous and rapidly changing applications and IT infrastructures, announced the general availability of OpsDataStore 1.0 — the industry’s first big data back end for all IT management data.