Sign up for our newsletter and get the latest big data news and analysis.

Building a Successful Predictive Analytics Program

In this special guest feature, Jane Hendricks, WW Portfolio Marketing Lead at IBM Predictive Analytics, describes a methodology for realizing business value from predictive analytics: start by understanding the business before the data that’s available and obtainable, then develop and apply models while considering how the models can be put into practice. The article includes three short case studies that illustrate the successful application of these principles.

Data Science 101: Clustering Approaches & Techniques

The presentation below by Derek Kane provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models.

GridGain Announces Support Offering for Apache® Ignite™

GridGain Systems, provider of enterprise-grade In-Memory Data Fabric solutions based on Apache® Ignite™, announced the availability of its Standard Professional Support subscription, which includes a license for the new GridGain In-Memory Data Fabric – Professional Edition 1.5, a fully supported version of Apache Ignite.

Want to Get More Out of Hadoop? Here Are 5 Ways

In this special guest feature, Ashley Stirrup, CMO at Talend, provides a useful list of five ways to get more out of Hadoop as organizations increasingly look to speed time to market, anticipate and respond to customers’ needs, and introduce new products and services.

Book Review: Why – A Guide to Finding and Using Causes

A new book, “ Why: A Guide to Finding and Using Causes ,” by Stevens Institute of Technology assistant professor of computer science Samantha Kleinberg is a necessary addition to any data scientist’s bookshelf as it helps bring focus to the dreaded “correlation does not imply causation” conundrum that affects our understanding of data-centric problems.

Unleash the Power of Data with Dataguise DgSecure for Amazon Web Services

Dataguise, a technology leader in secure business execution, announced Dataguise DgSecure for the detection, protection and monitoring of sensitive data across Amazon Simple Storage Service (Amazon S3) via the Amazon Elastic MapReduce (Amazon EMR) platform.

Nalanda Technology Unleashes the Power of Precision Data Search

Nalanda Technology, established in 2013, has developed the next generation precision search and discovery platform. Their solutions enable users to carry out a detailed search of their data regardless of where it is stored.

Scaling Home Energy Simulation to New Markets

In this special guest feature, Mark Gately, Senior Manager, Decision Science at Tendril, describes a future when homeowners can use data analytics to accurately model energy consumption and corresponding costs.

Yandex Data Factory Launches Automatic Image Moderation for Better Control of User-Generated Content

Machine learning and data analytics experts Yandex Data Factory, today announced the launch of Automatic Image Moderation – a new service that uses machine learning and computer vision to automate image analysis and classification.

AI System Predicts 85 Percent of Cyber­attacks Using Input from Human Experts

In a new paper, researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) and the machine­ learning start­up PatternEx demonstrate an artificial­ intelligence platform called “AI2” that predicts cyber­ attacks significantly better than existing systems by continuously incorporating input from human experts.