Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: An Introduction to scikit-learn – Machine Learning in Python

datascientist2_featured

The tutorial presentation below offers an introduction to the scikit-learn package and to the central concepts of Machine Learning.

Book Review: The Manga Guide to Linear Algebra

Manga_Linear_Algebra_feature

I was happy to receive a review copy of book employing a very unique approach for teaching mathematics, “The Manga Guide to Linear Algebra,” published by No Starch Press. This is a comic book, perfect for new data scientists! The book is great for newbies because it clearly spells out each minute step in performing calculations involving vectors, matrices, determinants, linear transformations, kernels, eigenvalues and eigenvectors.

Hadoop 101: Machine Learning in Big Data – Look Forward or Be Left Behind

hadoop-101

In the Hadoop Summit 2015 presentation below, Bill Porto, senior analytics engineer at RedPoint Global, will discuss why continual, adaptive optimization is key to maintaining a leadership position in the market.

Insilico Medicine to Utilize Deep Learning for Drug Repurposing and Discovery in Cancer and Age-Related Disease

Insilico

Insilico Medicine, a bioinformatics company dedicated to drug discovery for cancer and aging, has launched its proprietary DeepPharma (TM) platform. DeepPharma utilizes the latest advances in deep learning to improve computer analysis of massive structured multi-omics data banks and millions of tissue-specific pathway activation profiles.

DataSift Makes Machine Learning Accessible to All with VEDO Intent

datasift-logo

DataSift announced the launch of VEDO Intent, a new technology that significantly simplifies machine learning, helping developers to place it into the hands of people across the business.

Linux Foundation Announces R Consortium to Support Millions of Users Around the World

rconsort_logo

The Linux Foundation, the nonprofit organization dedicated to accelerating the growth of Linux and collaborative development, announced the R Consortium. This new organization will strengthen both the technical and user communities as a Collaborative Projects hosted at Linux Foundation.

NVIDIA Doubles Performance for Deep Learning Training

Nvidia_logo

NVIDIA announced updates to its GPU-accelerated deep learning software that will double deep learning training performance. The new software will empower data scientists and researchers to supercharge their deep learning projects and product development work.

Strategic Big Data Pivot Leads Webtrends to Success

Peter_Crossley_Webtrends

The interview that follows is with Peter Crossley, Director of Product Architecture at Webtrends to discuss his company’s data platform centered around Hadoop and Spark.

Big Data Technology for Scientific Research

BigData_science

This article is the third in an editorial series with a goal to provide a road map for scientific researchers wishing to capitalize on the rapid growth of big data technology for collecting, transforming, analyzing, and visualizing large scientific data sets.

At TrueCar It’s Time to Invest and Be Innovative with Big Data

Russell_Foltz-Smith

At the recent Hadoop Summit 2015 in San Jose, I had the opportunity to sit down with Russell Foltz-Smith, VP of Data Platform with TrueCar, Inc. to discuss his company’s use of data in general and Hadoop specifically.