Sign up for our newsletter and get the latest big data news and analysis.

Cardinal Health: Improving Sales Performance with Data Blending & Advanced Analytics


In the presentation below, the Nuclear Pharmaceutical Services division of Cardinal Health uses Alteryx to combine data from, an Access Database, an Excel spreadsheet, and Teradata; then performs time series forecasting before writing the data back to a Teradata Datalab.

Spark 101: Running Spark and MapReduce together in Production


Clusters must be tuned properly to run memory-intensive systems like Spark, H2O, and Impala alongside traditional MapReduce jobs. This Hadoop Summit 2015 talk describes Altiscale’s experience running the new memory-intensive systems in production for its customers.

Data Science in Today’s Marketplace

Data Science

In the video presentation below, Charles Martin, Chief Scientist at Calculation Consulting, spoke to a class at Cal Berkeley’s Haas Business School. Given the business oriented audience, the discussion is high-level and not too technical but rather very practical in nature.

Data Science 101: Introduction to Deep Learning with Python


In the presentation below, Alec Radford, Head of Research at indico Data Solutions, talks about deep learning with Python and the Theano library.

Video: Cray Powered Analytics for Major League Baseball


In this video, Barry Bolding from Cray joins MLB Now to discuss how technology can benefit and enhance baseball. According to recent reports, at least one team in Major League Baseball is using a Cray Urika system in a bid to gain competitive advantage.

Data Science 101: How Deep Learning Powers Flickr


In recent years, deep learning is making tremendous strides in the field of machine learning. To provide insights into how businesses are using this technology, the video presentation below looks behind the scenes at a company with a very recognizable name – Flickr. The presenter is Dr. Pierre Garrigues, Researcher in Machine Perception & Learning […]

Data Science 101: Deep Learning for Language Understanding

Data Science

The presentation below, “Deep Learning for Language Understanding,” took place at the Deep Learning Summit in San Francisco on 29-30 January 2015. The featured speaker is Quoc Le, Research Scientist at Google.

Metanautix Launches Personal Quest Data Compute Engine for Individual Users


Metanautix, a big data analytics company focused on simplifying the data supply chain, today announced the availability of Metanautix Personal Quest. Personal Quest allows individual users to make rapid decisions on data assets of different format, shape and location using preferred tools like Tableau and the high-level functionality of standard SQL.

Altiscale Announces Apache Spark on the Altiscale Data Cloud


Altiscale, Inc., a leading provider of Hadoop-as-a-Service, today announced that Apache Spark is now available on the Altiscale Data Cloud. Altiscale customers can now leverage Apache Spark on Apache Hadoop in order to achieve their critical analytical and business objectives.

Data Science 101: What’s Coming for Spark in 2015


Apache Spark took the data science world by storm in 2014 as a technology foundation for big data applications. In the talk below from the Bay Area Spark User Meetup, Patrick Wendell from Databricks speaks about new developments in Spark and identifies areas of focus in the coming year.