I’m frequently asked about educational resources for those making their entry into the data science and machine learning professions. There are plenty of good advanced books such as theoretical masterpieces. The book “R for Everyone: Advanced Analytics and Graphics” authored by Jared P. Lander covers the intermediate ground very well.
Talend, a global leader in big data integration software, today introduced Talend Data Preparation, a self-service application that enables business users to simplify and expedite the often laborious and time consuming process of data wrangling or the data manipulation and analysis tasks that are often performed using spreadsheets.
In biomedical research and development, researchers use text mining tools to extract and interpret facts, assertions, and relationships from vast amounts of published information. Mining accelerates the research process, increases discovery of novel findings, and helps companies identify potential safety issues in the drug development process. However, despite the many benefits of text mining, researchers face a number of obstacles before they even get a chance to run queries against the body of biomedical literature.
In this special guest feature David Fishman, Vice President of Marketing for Arcadia Data, puts a new and interesting twist on the hot topic of big data adoption.
Above the Trend Line: machine learning industry rumor central, is a new recurring feature of insideBIGDATA. In this column, we’ll present a variety of short news items such as people movements, funding news, rumors and general scuttlebutt floating around the big data, data science and machine learning industries including behind-the-scenes anecdotes and curious buzz.
Trying to show the data analysis package R is no more scary than Excel, John Mount of the Win-Vector blog shows a simple analysis both in Excel and in R.
Yahoo Inc. (NASDAQ: YHOO) announced the public release of the largest-ever machine learning dataset to the academic research community. With this release, the company aims to advance the field of large-scale machine learning and recommender systems, and to help level the playing field between industrial and academic research.
In this special guest feature, Miles Johnson and Sam Hochgraf of IBB Consulting Group, discuss how to build small, highly-specialized teams of experts that can work collaboratively to support the data science pipeline.
Impetus Technologies, a big data thought leader, software, services and solutions company, announced plans to hire 150 big data architects, senior developers and data scientists to support the rapid growth of its business.