Sign up for our newsletter and get the latest big data news and analysis.

Book Review: R for Everyone – Advanced Analytics and Graphics

I’m frequently asked about educational resources for those making their entry into the data science and machine learning professions. There are plenty of good advanced books such as theoretical masterpieces. The book “R for Everyone: Advanced Analytics and Graphics” authored by Jared P. Lander covers the intermediate ground very well.

Talend Releases Free, Easy-to-Use Desktop App for Quickly Preparing Data for Analysis

Talend, a global leader in big data integration software, today introduced Talend Data Preparation, a self-service application that enables business users to simplify and expedite the often laborious and time consuming process of data wrangling or the data manipulation and analysis tasks that are often performed using spreadsheets.

Key Challenges for Commercial Text Miners

In biomedical research and development, researchers use text mining tools to extract and interpret facts, assertions, and relationships from vast amounts of published information. Mining accelerates the research process, increases discovery of novel findings, and helps companies identify potential safety issues in the drug development process. However, despite the many benefits of text mining, researchers face a number of obstacles before they even get a chance to run queries against the body of biomedical literature.

Top 12 Explanations You’ll Hear in 2016 for Why Big Data Isn’t Paying Off

In this special guest feature David Fishman, Vice President of Marketing for Arcadia Data, puts a new and interesting twist on the hot topic of big data adoption.

“Above the Trend Line” – Your Industry Rumor Central for 1/25/2016

Above the Trend Line: machine learning industry rumor central, is a new recurring feature of insideBIGDATA. In this column, we’ll present a variety of short news items such as people movements, funding news, rumors and general scuttlebutt floating around the big data, data science and machine learning industries including behind-the-scenes anecdotes and curious buzz.

Using Excel Versus Using R

Trying to show the data analysis package R is no more scary than Excel, John Mount of the Win-Vector blog shows a simple analysis both in Excel and in R.

Yahoo Releases the Largest-ever Machine Learning Data Set for Researchers

Yahoo Inc. (NASDAQ: YHOO) announced the public release of the largest-ever machine learning dataset to the academic research community. With this release, the company aims to advance the field of large-scale machine learning and recommender systems, and to help level the playing field between industrial and academic research.

Organizing Data Science Teams For Strong ROI

In this special guest feature, Miles Johnson and Sam Hochgraf of IBB Consulting Group, discuss how to build small, highly-specialized teams of experts that can work collaboratively to support the data science pipeline.

Impetus Technologies Reveals Plans to Hire 150 Senior Technologists and Data Scientists in the U.S.

Impetus Technologies, a big data thought leader, software, services and solutions company, announced plans to hire 150 big data architects, senior developers and data scientists to support the rapid growth of its business.

The State of Data Science in 2016

More and more people are entering the data science field everyday, but how have methodologies changed over the past few years? The infographic below courtesy of Platfora provides several key points.