Sign up for our newsletter and get the latest big data news and analysis.

Datameer’s Stefan Groschupf on the Role of the CDO

big_data_brews

As another episode of the Big Data & Brews industry perspectives series, Stefan Groschupf, CEO of our friends over at Datameer, shares his thoughts on the role of the Chief Data Officer in today’s enterprise environment.

Data Science 101: Using the RForcecom R Package for Salesforce

Data Science

In the short instructional video below, you’ll learn how to set up RForcecom, read an opportunity list from Salesforce and utilize a decision tree machine learning algorithm.

Datameer’s Stefan Groschupf on the Future of Spark

big_data_brews

As another episode of the Big Data & Brews industry perspectives series, Stefan Groschupf, CEO of our friends over at Datameer, shares his thoughts on the future of Spark and how it is part of an evolution in the Hadoop environment.

Video: Fast, Beautiful and Easy Bayesian Modeling

imgres

“There are a number of Bayesian modelling packages available, but how do you know which one to use? This talk will take you through the positives and negatives of the major packages, focusing on the specifics of my work in health statistics, as well as providing a general overview of what these packages can do.”

Hadoop 101: Machine Learning in Big Data – Look Forward or Be Left Behind

hadoop-101

In the Hadoop Summit 2015 presentation below, Bill Porto, senior analytics engineer at RedPoint Global, will discuss why continual, adaptive optimization is key to maintaining a leadership position in the market.

Video: Data-driven Education and the Quantified Student

Lorena Barba, George Washington University

In this video from the PyData Seattle Conference, Lorena Barba from George Washington University presents: Data-driven Education and the Quantified Student. “Education has seen the rise of a new trend in the last few years: Learning Analytics. This talk will weave through the complex interacting issues and concerns involving learning analytics, at a high level. The goal is to whet the appetite and motivate reflection on how data scientists can work with educators and learning scientists in this swelling field.”

Slidecast: Introducing the Seagate 1200.2 SAS SSD

seagate2

“The Seagate 1200.2 SSD family includes the next-generation of high-capacity, high-performance SAS SSDs designed with multiple endurance offerings optimized for demanding enterprise applications and maximum TCO savings. The 1200.2 SAS SSD family delivers ultra-fast, consistent and easily scalable performance that exceeds 12Gb/s SAS single port bandwidth. By removing the storage bottleneck, it closes the gap between processor and data storage performance and significantly improves overall system and application responsiveness.”

Analytics at the Speed of Business with Alteryx and Tableau

BigData use case

In the presentation below by Wendy Gradek, Sr. Manager EOS BI and Analytics, EMC, you’ll hear about the benefits they’re seeing in resource optimization, usability, and boot camps, plus direct feedback from the business teams who are using Alteryx and Tableau.

Cardinal Health: Improving Sales Performance with Data Blending & Advanced Analytics

Big-Data-Healthcare

In the presentation below, the Nuclear Pharmaceutical Services division of Cardinal Health uses Alteryx to combine data from Salesforce.com, an Access Database, an Excel spreadsheet, and Teradata; then performs time series forecasting before writing the data back to a Teradata Datalab.

Spark 101: Running Spark and MapReduce together in Production

Spark_logo_feature

Clusters must be tuned properly to run memory-intensive systems like Spark, H2O, and Impala alongside traditional MapReduce jobs. This Hadoop Summit 2015 talk describes Altiscale’s experience running the new memory-intensive systems in production for its customers.