Search Results for: parallel file systems

A Contrast of Paradigms – HPCC Systems & Hadoop

January 13, 2013 by Rich Brueckner 1 Comment

Flavio Villanustre writes about the differences between two powerful open source Big Data platforms: HPCC and Hadoop. HPCC and Hadoop are both open source projects released under an Apache 2.0 license, and are free to use, with both leveraging commodity hardware and local storage interconnected through IP networks, allowing for parallel data processing and/or querying […]

Filed Under: Big Data Software, HPC, Machine Learning, Red Hat

Video: Overview of OrangeFS, Open Source File System for Data-Intensive Workloads

November 25, 2011 by Rich Brueckner Leave a Comment

In this video, Clemson’s Dr. Walt Ligon provides an overview of OrangeFS, an open source file system tailor-made for Big Data. OrangeFS is an open source parallel file system developed from PVFS. OrangeFS provides good scalability for large HPC systems and can be used with emerging data intensive workloads. Perhaps more importantly, OrangeFS is very […]

Filed Under: Big Data Software, HPC, Video

Best Practices – Big Data Acceleration

March 8, 2016 by Rich Brueckner Leave a Comment

“This talk will provide an overview of challenges in accelerating Hadoop, Spark and Memcached on modern HPC clusters. An overview of RDMA-based designs for multiple components of Hadoop (HDFS, MapReduce, RPC and HBase), Spark, and Memcached will be presented. Enhanced designs for these components to exploit in-memory technology and parallel file systems (such as Lustre) will be presented. Benefits of these designs on various cluster configurations using the publicly available RDMA-enabled packages from the OSU HiBD project (http://hibd.cse.ohio-state.edu) will be shown.”

Filed Under: Big Data Hardware, Events, Featured, Hadoop, Industry Segments, Mellanox, News / Analysis, Research / Education, Resources, Topics, Video

Efficiency: Big Data Meets HPC in Financial Services

October 20, 2015 by Leave a Comment

Converging High Performance Computing (HPC) and Lustre* parallel file systems with Hadoop’s MapReduce for Big Data analytics can eliminate the need for Hadoop’s infrastructure and speeding up the entire analysis. Convergence is a solution of interest for companies with HPC already in their infrastructure, such as the financial services Industry and other industries adopting high performance data analytics.

Filed Under: Big Data, Financial, Hadoop, HPC, Intel, Main Feature Tagged With: Intel, lust, Weekly Featured Newsletter Post

insideBIGDATA Latest News – 8/14/2020

August 14, 2020 by Daniel Gutierrez Leave a Comment

In this regular column, we’ll bring you all the latest industry news centered around our main topics of focus: big data, data science, machine learning, AI, and deep learning. Our industry is constantly accelerating with new products and services being announced everyday. Fortunately, we’re in close touch with vendors from this vast ecosystem, so we’re in a unique position to inform you about all that’s new and exciting. Our massive industry database is growing all the time so stay tuned for the latest news items describing technology that may make you and your organization more competitive.

Filed Under: AI Deep Learning, Analytics, Big Data, Big Data Hardware, Big Data Services, Big Data Software, Data Science, Featured, Google News Feed, Machine Learning, News / Analysis, Uncategorized Tagged With: AI, artificial intelligence, Big Data, data science, Deep Learning, Machine Learning

Why You Need a Modern Infrastructure to Accelerate AI and ML Workloads

August 8, 2019 by Editorial Team Leave a Comment

Recent years have seen a boom in the generation of data from a variety of sources: connected devices, IoT, analytics, healthcare, smartphones, and much more. This data management problem is particularly acute in the areas of Artificial Intelligence (AI) and Machine Learning (ML) workloads. This guest article from WekaIO highlights why focusing on optimizing infrastructure can spur machine learning workloads and AI success.

Filed Under: AI Deep Learning, Enterprise, Featured, Google News Feed, Industry Segments, Machine Learning, News / Analysis, Sponsored Post, Topics Tagged With: AI, artificial intelligence, Machine Learning, Weekly Featured Newsletter Post, WekaIO

insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning – Part 4

December 5, 2018 by Daniel Gutierrez Leave a Comment

With AI and DL, storage is cornerstone to handling the deluge of data constantly generated in today’s hyperconnected world. It is a vehicle that captures and shares data to create business value. In this technology guide, insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning, we’ll see how current implementations for AI and DL applications can be deployed using new storage architectures and protocols specifically designed to deliver data with high-throughput, low-latency and maximum concurrency.

Filed Under: AI Deep Learning, Data Storage, Featured, Google News Feed, News / Analysis, Uncategorized, White Papers Tagged With: artificial intelligence, Data Storage, Weekly Featured Newsletter Post

Big Data Meets HPC – Exploiting HPC Technologies for Accelerating Big Data Processing

March 12, 2018 by Rich Brueckner Leave a Comment

DK Panda from Ohio State University gave this talk at the Stanford HPC Conference. “This talk will provide an overview of challenges in accelerating Hadoop, Spark and Memcached on modern HPC clusters. An overview of RDMA-based designs for Hadoop (HDFS, MapReduce, RPC and HBase), Spark, Memcached, Swift, and Kafka using native RDMA support for InfiniBand and RoCE will be presented.”

Filed Under: Big Data, Companies, Events, Featured, Industry Segments, Mellanox, Research / Education, Resources, Video Tagged With: hpc, InfiniBand, iWARP, OpenHPC, RoCE, Weekly Featured Newsletter Post

Enabling Value for Converged Commercial HPC and Big Data Infrastructures through Lustre*

September 15, 2015 by Leave a Comment

A number of industries rely on high-performance computing (HPC) clusters to process massive amounts of data. As these same organizations explore the value of Big Data analytics based on Hadoop, they are realizing the value of converging Hadoop and HPC onto the same cluster rather than scaling out an entirely new Hadoop infrastructure.

Filed Under: Big Data Hardware, Big Data Software, Hadoop, HPC, Industry Perspectives, Infrastructure, Intel, Main Feature Tagged With: big data infrastructure, big data storage, Intel, Lustre, Weekly Featured Newsletter Post

Search Results for: parallel file systems

A Contrast of Paradigms – HPCC Systems & Hadoop

Video: Overview of OrangeFS, Open Source File System for Data-Intensive Workloads

Best Practices – Big Data Acceleration

Efficiency: Big Data Meets HPC in Financial Services

insideBIGDATA Latest News – 8/14/2020

Why You Need a Modern Infrastructure to Accelerate AI and ML Workloads

insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning – Part 4

Big Data Meets HPC – Exploiting HPC Technologies for Accelerating Big Data Processing

Enabling Value for Converged Commercial HPC and Big Data Infrastructures through Lustre*

Sponsored Guest Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage

White Papers

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

Featured RSS Feed

More News from insideHPC