Sign up for our newsletter and get the latest big data news and analysis.

The Future of Data Science

Here at insideBIGDATA, we’re very serious about data science and machine learning. Data science holds the potential to dramatically impact our lives and how we work. Despite its promise, many questions about data science remain.

Syncsort’s Latest Innovations Simplify Integration of Streaming Data in Spark, Kafka and Hadoop for Real-Time Analytics

Syncsort, a global leader in Big Data software, announced new capabilities, including native integration with Apache Spark and Apache Kafka, allowing organizations to access and integrate enterprise-wide data with streams from real-time sources.

Data Science 101: GPU Programming for Beginners

The presentation below is an educational resource that sets the stage for parallel programming with GPUs (graphics processing units) and was sponsored by the Center for Astrophysics and Supercomputing at Swinburne University of Technology. GPUs are becoming quite popular for the implementation of deep learning solutions.

Dell Further Democratizes Advanced Analytics With Latest Release of Statistica

Dell announced a major new release of its award-winning Statistica advanced analytics platform, Dell Statistica version 13.1. This latest version delivers a host of capabilities designed to empower citizen data scientists, help organizations better address growing IoT analytics requirements, and better leverage increasingly heterogeneous data environments.

Data Science 101: Clustering Approaches & Techniques

The presentation below by Derek Kane provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models.

Slack Your Data: Looker Releases Slack Integration that Lets Your Data Join the Conversation

Looker, the company that is powering data-driven businesses, announced Lookerbot, the first enterprise data platform to integrate with Slack.

Can Spark Data Tools Stamp Out Cyber Crime?

The video presentation below discusses how big data engines like Apache Spark are being deployed to help detect and put an end to ad fraudulence. Spark allows for enterprises across various sectors, including security firms, to extract data in real time to catch patterns and help halt fraudulent activities and breaches earlier.

Video: Machine Learning Overview from NERSC

In this video from the HPC User Forum in Tucson, Prabhat from NERSC presents: Machine Learning. “Prabhat leads the Data and Analytics Services team at NERSC. His current research interests include scientific data management, parallel I/O, high performance computing and scientific visualization.”

NVIDIA Delivers Massive Performance Leap for Deep Learning with NVIDIA Tesla P100 Accelerators

The latest addition to the NVIDIA Tesla Accelerated Computing Platform, the Tesla P100 enables a new class of servers that can deliver the performance of hundreds of CPU server nodes. Today’s data centers — vast network infrastructures with numerous interconnected commodity CPU servers — process large numbers of transactional workloads, such as web services.

Data Science and Statistics: Different Worlds?

The video presentation below, courtesy of the Royal Statistical Society, includes a panel of distinguished practitioners to bring their own perspectives on important issues surrounding the growing field of data science.