Sign up for our newsletter and get the latest big data news and analysis.

Content Raven Launches “Marketing Raven”


Content Raven, a leading cloud-based file distribution toolkit that adds content control, security and deep analytics to files, has announced the addition of Marketing Raven™ to its suite of products.

The Different Types of Programmers

In this special guest feature, Jesse Anderson from Cloudera writes about his perspectives on becoming a computer programmer including education, aptitude and other musings. As an extra bonus check out the tutorial video at the end of the article.

Data Science 101: An Interview with Hadley Wickham


RStudio’s Chief Scientist Hadley Wickman was interviewed by DataScience.LA’s Eduardo Arino de la Rubia during the useR!2014 conference at UCLA this past July.

John Chambers: Interfaces, Efficiency and Big Data

In the video presentation below, industry luminary John Chambers makes a keynote presentation at the recent useR! 2014 conference, Interfaces, Efficiency and Big Data.

Data Science, Big Data and Statistics – can we all live together?

Here is a topic that receives much debate these days – as diverse fields like statistics, computer science and applied mathematics converge with newly named fields such as data science and big data. Can’t we all get along?

Predictive Analytics for Big Data Using EmcienPatterns


EmcienPatterns is Emcien’s Data Analysis Platform, providing complete and automated data analysis by revealing the patterns in data, analyzing those connections, and delivering answers to the user or to downstream systems through APIs.

In-Memory Computing: Three Myths That Could Put Your Business at Risk

Eric Frenkiel_MemSQL

In this special guest feature, Eric Frenkiel, Co-founder and CEO, MemSQL writes about the three myth surrounding in-memory computing and how companies that don’t take advantage of IMC risk being left behind.

Data Science 101: SparkR – Interactive R Programs at Scale

R + RDD = R2D2

R is a widely used statistical programming language but its interactive use is typically limited to a single machine. To enable large scale data analysis from R, SparkR was announced earlier this year in a blog post. SparkR is an open source R package developed at U.C. Berkeley AMPLab that allows data scientists to analyze large data sets and interactively run jobs on them from the R shell.

Data Science 101: Real-time Analytics using Cassandra, Spark and Shark

In the video below, Evan Chan (Software Engineer at Ooyala), describes his experience using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.

Project Adam: a New Deep-Learning System


Project Adam is a new deep-learning system modeled after the human brain that has greater image classification accuracy and is 50 times faster than other systems in the industry. Project Adam is an initiative by Microsoft researchers and engineers that aims to demonstrate that large-scale, commodity distributed systems can train huge deep neural networks effectively.