Sign up for our newsletter and get the latest big data news and analysis.

Linguistic Tools Usage in Managing Corporate Data

In this special guest feature, Yana Yelina from EffectiveSoft, discusses the problems and potential solutions in enterprises to address a rising volume of documents and the ability to effectively utilize data stored in document silos.

Benefits of Having a Data Scientist Career

With the continued upward trajectory of interest in getting on board with a data science career, our friends over at Simplilearn Solutions put together the compelling infographic below.

Book Review: The Manga Guide to Regression Analysis

Last year, I wrote a review of a useful book that got students up to speed with a key mathematical ingredient of machine learning – linear algebra: The Manga Guide to Linear Algebra. No Starch Press (an excellent source of technical books) just came out with a follow-up title: The Manga Guide to Regression Analysis.

DataScience Inc. Unveils the DataScience Cloud, a Platform That Makes Data Scientists Central to Every Business Function

DataScience, Inc., announced the DataScience Cloud, a platform that enables data scientists to explore varied data sources, build models and algorithms, and seamlessly deploy work throughout their entire organization, regardless of their tech stack and level of engineering support.

Guide to Data Science Interviews

Our friends over at Springboard just released a compelling new infographic that highlights some salary metrics with different data professional roles.

Book Review: The Book of R by Tilman Davies

A fantastic new book just landed on my desk, “The Book of R: A First Course in Programming and Statistics” by Tilman M. Davies from No Starch Press. I’ve been looking for a book like this for some time – to use with the introductory data science and machine learning course I teach.

How Viacom Built a Just-in-Time Data Warehouse

In the video presentation below from Spark Summit East 2016 conference, Viacom, the global media company, explains how they are using Apache Spark and Databricks to quickly adapt to their audience by building a just-in-time data warehouse.

StreamSets Launches Embed Program to Help Technology Innovators Integrate Big Data Flows Into Their Products

StreamSets, the dataflow performance management company, announced the StreamSets Embed Program, which provides support and services to technology companies needing to embed world-class data ingestion capabilities into their products and services.

Redis Labs and Intel Achieve Record Breaking Performance of 3 Million Database Operations/Second

Redis Labs, home of Redis, and Intel announced that they have collaboratively benchmarked a throughput of 3 million database operations/second at under 1 millisecond of latency, while generating over 1GB NVMe throughput, on a single server with Redis on Flash and Intel NVMe-based SSDs.

Data Science at Ticketmaster

In the video presentation below, Jenn Webb, Managing Editor at Radar, interviews John Carnahan who serves as Executive Vice President of Data Science at Ticketmaster.