Deci delivers breakthrough inference performance on Intel’s 4th Gen Sapphire Rapids CPU

Deci, the deep learning company building the next generation of AI, announced a breakthrough performance on Intel’s newly released 4th Gen Intel® Xeon® Scalable processors, code-named Sapphire Rapids. By optimizing the AI models which run on Intel’s new hardware, Deci enables AI developers to achieve GPU-like inference performance on CPUs in production for both Computer Vision and Natural Language Processing (NLP) tasks.

CoreWeave Among First Cloud Providers to Offer NVIDIA HGX H100 Supercomputers Set to Transform AI Landscape

CoreWeave, a specialized cloud provider built for large-scale GPU-accelerated workloads, announced it is among the first to offer cloud instances with NVIDIA HGX H100 supercomputing. CoreWeave, Amazon, Google, Microsoft and Oracle are the first cloud providers included in the launch of this groundbreaking AI platform.

Oracle and NVIDIA Partner to Speed AI Adoption for Enterprises

Expanding their longstanding alliance, Oracle and NVIDIA today announced a multi-year partnership to help customers solve business challenges with accelerated computing and AI. The collaboration aims to bring the full NVIDIA accelerated computing stack — from GPUs to systems to software — to Oracle Cloud Infrastructure (OCI).

NVIDIA Hopper in Full Production

NVIDIA today announced that the NVIDIA H100 Tensor Core GPU is in full production, with global tech partners planning in October to roll out the first wave of products and services based on the groundbreaking NVIDIA Hopper™ architecture.

Exxact Partners with Run:ai to Offer Maximal Utilization in GPU Clusters for AI Workloads

Exxact Corporation; a leading provider of high-performance computing (HPC), artificial intelligence (AI), and data center solutions; now offers Run:ai in their solutions. This groundbreaking Kubernetes-based orchestration tool incorporates an AI-dedicated, high-performant super-scheduler tailored for managing GPU resources in AI clusters.

AMAX Launches GPU Servers Powered by Intel’s Newest Data Center GPU Flex Series for AI, Gaming, & Media Streaming

AMAX, a leading provider of turnkey rack-scale High Performance Computing (HPC) solutions, Deep Learning/AI applications and server appliance manufacturing, announces the new AceleMax X-122-Flex server solution featuring Intel’s next-generation Data Center GPU Flex Series, (formerly code-named Arctic Sound-M), providing the capability of a graphics processing (GPU) solution handling high density and complex workloads targeted towards media delivery, cloud gaming, AI, metaverse, and other emerging visual cloud use cases.

Introducing Vultr Talon with NVIDIA GPUs — Cloud Platform Breakthrough Makes Accelerated Computing Efficient and Affordable

Vultr®, a leading independent provider of cloud infrastructure, announced that Vultr Talon, powered by NVIDIA GPUs and NVIDIA AI Enterprise software, is now available in beta. A breakthrough cloud-based platform, Vultr Talon offers affordable accelerated computing by enabling GPU sharing, so multiple workloads can efficiently run on a single NVIDIA GPU.

The Looming Datacenter Paradigm Shift

In this contributed article, Jonathan Friedmann, CEO & Co-Founder of Speedata, discusses a paradigm shift that is already underway. CPU’s leading role in datacenters is no longer a given, and as database workloads grow increasingly varied, market forces will point more and more in a clear direction: the development of new, dedicated, accelerated solutions. Data workloads are only going to grow more diverse and intricate from here. The same holds true for the chips that will process them.

AWS Announces General Availability of Amazon EC2 DL1 Instances

Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), announced general availability of Amazon Elastic Compute Cloud (Amazon EC2) DL1 instances, a new instance type designed for training machine learning models. DL1 instances are powered by Gaudi accelerators from Habana Labs (an Intel company) to provide up to 40% better price performance for training machine learning models than the latest GPU-powered Amazon EC2 instances.

Building the GPU-accelerated Datacenter

[Sponsored Post] To learn about GPU-accelerated datacenters, join the upcoming live webinar from Thinkmate and PNY. You’ll hear about the future of the datacenter, why the GPU is crucial, the technology behind GPU acceleration, and what sort of options exist for different industries or types of organizations.