Here at insideBIGDATA, we’re very serious about data science and machine learning. Data science holds the potential to dramatically impact our lives and how we work. Despite its promise, many questions about data science remain.
Syncsort’s Latest Innovations Simplify Integration of Streaming Data in Spark, Kafka and Hadoop for Real-Time Analytics
Syncsort, a global leader in Big Data software, announced new capabilities, including native integration with Apache Spark and Apache Kafka, allowing organizations to access and integrate enterprise-wide data with streams from real-time sources.
The presentation below is an educational resource that sets the stage for parallel programming with GPUs (graphics processing units) and was sponsored by the Center for Astrophysics and Supercomputing at Swinburne University of Technology. GPUs are becoming quite popular for the implementation of deep learning solutions.
Dell announced a major new release of its award-winning Statistica advanced analytics platform, Dell Statistica version 13.1. This latest version delivers a host of capabilities designed to empower citizen data scientists, help organizations better address growing IoT analytics requirements, and better leverage increasingly heterogeneous data environments.
The presentation below by Derek Kane provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models.
Looker, the company that is powering data-driven businesses, announced Lookerbot, the first enterprise data platform to integrate with Slack.
The video presentation below discusses how big data engines like Apache Spark are being deployed to help detect and put an end to ad fraudulence. Spark allows for enterprises across various sectors, including security firms, to extract data in real time to catch patterns and help halt fraudulent activities and breaches earlier.
In this video from the HPC User Forum in Tucson, Prabhat from NERSC presents: Machine Learning. “Prabhat leads the Data and Analytics Services team at NERSC. His current research interests include scientific data management, parallel I/O, high performance computing and scientific visualization.”
The latest addition to the NVIDIA Tesla Accelerated Computing Platform, the Tesla P100 enables a new class of servers that can deliver the performance of hundreds of CPU server nodes. Today’s data centers — vast network infrastructures with numerous interconnected commodity CPU servers — process large numbers of transactional workloads, such as web services.