Sign up for our newsletter and get the latest big data news and analysis.

Spark 101: Running Spark and MapReduce together in Production

Spark_logo_feature

Clusters must be tuned properly to run memory-intensive systems like Spark, H2O, and Impala alongside traditional MapReduce jobs. This Hadoop Summit 2015 talk describes Altiscale’s experience running the new memory-intensive systems in production for its customers.

Data Science in Today’s Marketplace

Data Science

In the video presentation below, Charles Martin, Chief Scientist at Calculation Consulting, spoke to a class at Cal Berkeley’s Haas Business School. Given the business oriented audience, the discussion is high-level and not too technical but rather very practical in nature.

Data Science 101: Introduction to Deep Learning with Python

deeplearning

In the presentation below, Alec Radford, Head of Research at indico Data Solutions, talks about deep learning with Python and the Theano library.

Video: Cray Powered Analytics for Major League Baseball

mlb

In this video, Barry Bolding from Cray joins MLB Now to discuss how technology can benefit and enhance baseball. According to recent reports, at least one team in Major League Baseball is using a Cray Urika system in a bid to gain competitive advantage.

Data Science 101: How Deep Learning Powers Flickr

deeplearning

In recent years, deep learning is making tremendous strides in the field of machine learning. To provide insights into how businesses are using this technology, the video presentation below looks behind the scenes at a company with a very recognizable name – Flickr. The presenter is Dr. Pierre Garrigues, Researcher in Machine Perception & Learning […]

Data Science 101: Deep Learning for Language Understanding

Data Science

The presentation below, “Deep Learning for Language Understanding,” took place at the Deep Learning Summit in San Francisco on 29-30 January 2015. The featured speaker is Quoc Le, Research Scientist at Google.

Metanautix Launches Personal Quest Data Compute Engine for Individual Users

metanautix-logo_feature

Metanautix, a big data analytics company focused on simplifying the data supply chain, today announced the availability of Metanautix Personal Quest. Personal Quest allows individual users to make rapid decisions on data assets of different format, shape and location using preferred tools like Tableau and the high-level functionality of standard SQL.

Altiscale Announces Apache Spark on the Altiscale Data Cloud

altiscale_logo

Altiscale, Inc., a leading provider of Hadoop-as-a-Service, today announced that Apache Spark is now available on the Altiscale Data Cloud. Altiscale customers can now leverage Apache Spark on Apache Hadoop in order to achieve their critical analytical and business objectives.

Data Science 101: What’s Coming for Spark in 2015

Spark_logo_feature

Apache Spark took the data science world by storm in 2014 as a technology foundation for big data applications. In the talk below from the Bay Area Spark User Meetup, Patrick Wendell from Databricks speaks about new developments in Spark and identifies areas of focus in the coming year.

GPU Accelerated Platforms for Deep Learning

sumit

“NVIDIA will present an update on accelerated computing, in particular, the latest de- velopments in the platform. They will touch upon NVLink, OpenPOWER, ARM64, and new software updates and also cover the broad-sweeping impact that a new field of machine learning, called Deep Learning, is having on applications and domains.”