AWS and NVIDIA Extend Collaboration to Advance Generative AI Innovation

GTC—Amazon Web Services (AWS), an Amazon.com company (NASDAQ: AMZN), and NVIDIA (NASDAQ: NVDA) today announced that the new NVIDIA Blackwell GPU platform—unveiled by NVIDIA at GTC 2024—is coming to AWS. AWS will offer the NVIDIA GB200 Grace Blackwell Superchip and B100 Tensor Core GPUs, extending the companies’ longstanding strategic collaboration to deliver the most secure and advanced infrastructure, software, and services to help customers unlock new generative artificial intelligence (AI) capabilities.

Kinetica Delivers Real-Time Vector Similarity Search

Kinetica, the real-time GPU-accelerated database for analytics and generative AI, unveiled at NVIDIA GTC its real-time vector similarity search engine that can ingest vector embeddings 5X faster than the previous market leader, based on the popular VectorDBBench benchmark.

Heard on the Street – 3/21/2024

Welcome to insideBIGDATA’s “Heard on the Street” round-up column! In this regular feature, we highlight thought-leadership commentaries from members of the big data ecosystem. Each edition covers the trends of the day with compelling perspectives that can provide important insights to give you a competitive advantage in the marketplace.

DDN AI400X2 Turbo Appliance Accelerates Gen AI and Inference for Data Center and Cloud by 10x

DDN®, a global leader in artificial intelligence (AI) and multi-cloud data management solutions, announced the latest addition to its powerful A3I® solutions, the DDN AI400X2 Turbo. 30% more powerful than the AI400X2, the previous industry performance leader, the AI400X2 Turbo boasts faster performance and expanded connectivity options.

Taking ITSM To The Next Level, Faster—By Bringing AI Along For the Ride

In this contributed article, Krishna Sai from SolarWinds discusses how IT service management (ITSM) is at a crossroads. In recent years, IT teams have drastically changed how they communicate and collaborate. The evolution of IT infrastructure with the advent of cloud computing and big data has led to larger fleets of servers, more storage systems, and more complicated networks. This has led to more than just an increase in the quantity of devices and services we need to manage—there’s been a qualitative change in the level of complexity of the systems we need to manage.

NVIDIA Blackwell Platform Arrives to Power a New Era of Computing

GTC 2024—Powering a new era of computing, NVIDIA announced that the NVIDIA Blackwell platform has arrived — enabling organizations everywhere to build and run real-time generative AI on trillion-parameter large language models at up to 25x less cost and energy consumption than its predecessor.

Hitachi Vantara Announces Collaboration with NVIDIA to Create New Portfolio of Industrial AI Solutions

Hitachi Vantara, the data storage, infrastructure, and hybrid cloud management subsidiary of Hitachi, Ltd. (TSE: 6501), today announced a collaboration with NVIDIA to create a new generation of transformational artificial intelligence (AI) solutions. Hitachi Vantara will develop a portfolio of solutions, Hitachi iQ, to drive targeted AI outcomes by layering industry-specific capabilities on top of its AI solution stack, so outcomes can be more specific and relevant to an organization’s business. 

Pure Storage Accelerates Enterprise AI Adoption to Meet Growing Demands with NVIDIA AI 

Pure Storage® (NYSE: PSTG), the IT pioneer that delivers advanced data storage technology and services, today announced new validated reference architectures for running generative AI use cases, including a new NVIDIA OVX-ready validated reference architecture. As a leader in AI, Pure Storage, in collaboration with NVIDIA, is arming global customers with a proven framework to manage the high-performance data and compute requirements they need to drive successful AI deployments. 

The Five Step Playbook to Move GenAI into Production

In this contributed article, Josh Reini, Developer Relations Data Scientist, TruEra, discusses how gaining the required confidence to deploy GenAI apps at scale can be challenging, and structured evaluation has gained recognition as a key requirement on the path from science experiment to customer value. Evaluation frameworks can play a critical role in this journey by allowing developers to run experiments faster and gain systematic validation for production readiness. Connecting such an evaluation framework with a scaled observability platform brings confidence in production. This article explores five practical steps to move LLM applications from early prototypes to scaled, production applications.

Video Highlights: NumPy, SciPy and the Economics of Open-Source — with Dr. Travis Oliphant

In this video presentation, our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, explores the origins of NumPy and SciPy with their creator, Dr. Travis Oliphant. Dr. Oliphant shares his journey from personal need to global impact, the challenges overcome, and the future of these essential Python libraries in scientific computing and data science.