Sign up for our newsletter and get the latest big data news and analysis.

Interview: The Marketing Disruption Brought About by Machine Learning: Making Marketers the Heroes

I recently caught up with Dr. Olly Downs, Chief Scientist and CTO at Amplero to talk about the excitement surrounding his company’s launch of Amplero as well as some thoughts on the future of machine learning coupled with marketing.

Interview: Michael O’Connell, Chief Data Scientist at TIBCO


I recently caught up with Michael O’Connell who is Chief Data Scientist at TIBCO, a global leader in infrastructure and business intelligence software, to talk about the flourishing career outlook, demand and job security that a career in data science and business analytics are bringing to the higher education community.

Datameer’s Stefan Groschupf on the Future of Spark


As another episode of the Big Data & Brews industry perspectives series, Stefan Groschupf, CEO of our friends over at Datameer, shares his thoughts on the future of Spark and how it is part of an evolution in the Hadoop environment.

Hadoop 101: Machine Learning in Big Data – Look Forward or Be Left Behind


In the Hadoop Summit 2015 presentation below, Bill Porto, senior analytics engineer at RedPoint Global, will discuss why continual, adaptive optimization is key to maintaining a leadership position in the market.

Insilico Medicine to Utilize Deep Learning for Drug Repurposing and Discovery in Cancer and Age-Related Disease


Insilico Medicine, a bioinformatics company dedicated to drug discovery for cancer and aging, has launched its proprietary DeepPharma (TM) platform. DeepPharma utilizes the latest advances in deep learning to improve computer analysis of massive structured multi-omics data banks and millions of tissue-specific pathway activation profiles.

Spark 101: Anatomy of RDD – Deep Dive Into Spark RDD Abstraction


As Apache Spark continues its exponential rise in popularity as a big data platform, the presentation included below dives deeper into architecture -a detailed discussion about how RDD is constructed, transformed and executed over the cluster.

How Cox Auto Became a Data-driven Organization

BigData use case

In the presentation below, Cox Auto, a leading provider of automotive products and services, shares how it has transformed their business by simplifying data analysis and making data easily accessible to business decision makers.

Cardinal Health: Improving Sales Performance with Data Blending & Advanced Analytics


In the presentation below, the Nuclear Pharmaceutical Services division of Cardinal Health uses Alteryx to combine data from, an Access Database, an Excel spreadsheet, and Teradata; then performs time series forecasting before writing the data back to a Teradata Datalab.

Spark 101: Spark Streaming and GraphX at Netflix


The Bay Area Spark Meetup recently was hosted at Netflix to feature talks by Netflix engineers about their use of Spark Streaming and GraphX, as well as a Q&A session with the Netflix folks plus the lead engineer of Spark Streaming. The presentation is provided here with the abstracts of the two talks below.

Spark 101: Running Spark and MapReduce together in Production


Clusters must be tuned properly to run memory-intensive systems like Spark, H2O, and Impala alongside traditional MapReduce jobs. This Hadoop Summit 2015 talk describes Altiscale’s experience running the new memory-intensive systems in production for its customers.