Sign up for our newsletter and get the latest big data news and analysis.

Ask a Data Scientist: Ensemble Methods

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks about ensemble methods and how you use them.

The Hidden Costs of Open Source

big-data-pic

While Linux clusters dominate HPC, there are many issues related to cost and complexity that can make open-source solutions challenging. In addition, determining real costs can be complex because every environment is different, and organizations will assess costs using their own methodologies and based on their own requirements and capabilities.

What’s Hot & What’s Not in Data Science 2015

Big-Data-Trends-2015

In addition to analyzing data, our friends over at CrowdFlower — a people-powered data enrichment platform — has an affinity for lists. This year, they asked their team of data scientists to come up with what they think will be “hot” or “not” in their world for 2015.

New Research Shows Businesses are Investing Heavily in Big Data Analytics

BigData_2015

Big data is more than a buzzword, as proven by how fast organizations are adopting new analytics technologies to obtain business value from it. That is the key takeaway from a Luth Research survey of large organizations currently using big data analytics software or planning to use it in the next 12 months.

FoundationDB Extends Performance & Scalability with Version 3.0

foundationdb

FoundationDB®, the company behind database software that combines scalability and consistency, has announced version 3.0 of its flagship product, the FoundationDB Key-Value Store. Version 3.0 offers enhanced performance and monitoring capabilities while maintaining transactional integrity, scalability and fault tolerance for operational workloads in the cloud or on premise.

Ask a Data Scientist: Confounding Variables

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks for an explanation of confounding variables and why they’re important in data science projects.

Top 5 Challenges for Hadoop MapReduce in the Enterprise

big-data-pic

Now that MapReduce is becoming accepted as a ‘working’ model, the next goal is to turn it into an enterprise-class solution. IBM has written a technical white paper for overcoming common challenges when deploying Hadoop MapReduce – “Top 5 Challenges for Hadoop MapReduce in the Enterprise.”

Posiba – Big Data for Greater Social Impact in Giving

Posiba_logo

The landscape of charitable giving is about to receive a makeover with the launch of Posiba. Poised to revolutionize the way giving improves the world by bringing people and information together for greater impact, Posiba is a big data and analytics information service supporting foundations, governments, charities and donors using the power of aggregated information.

Ask a Data Scientist: Storytelling With Data

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who asks about the importance of “storytelling” in data science.

Deploying a Big Data Solution Using IBM GPFS-FPO

big-data-pic

Download this whitepaper today to learn best practices for deploying GPFS-FPO as a file system platform for big data analytics. The goal of this paper is to guide the administrator through various decision points to ensure optimal configuration based on the Hadoop application components being deployed.