inference Archives - insideBIGDATA

NVIDIA Launches Inference Platforms for Large Language Models and Generative AI Workloads

March 22, 2023 by Editorial Team Leave a Comment

NVIDIA launched four inference platforms optimized for a diverse set of rapidly emerging generative AI applications — helping developers quickly build specialized, AI-powered applications that can deliver new services and insights. The platforms combine NVIDIA’s full stack of inference software with the latest NVIDIA Ada, NVIDIA Hopper™ and NVIDIA Grace Hopper™ processors — including the NVIDIA L4 Tensor Core GPU and the NVIDIA H100 NVL GPU, both launched at GTC.

Filed Under: AI Deep Learning, Big Data, Data Science, Google News Feed, Machine Learning, Main Feature, News / Analysis, Uncategorized Tagged With: generative AI, inference, LLM, Nvidia, Weekly Featured Newsletter Post

Intel’s Habana Labs Launches Second-Generation AI Processors for Training and Inferencing

May 10, 2022 by Editorial Team Leave a Comment

Intel announced that Habana Labs, its data center team focused on AI deep learning processor technologies, launched its second-generation deep learning processors for training and inference: Habana® Gaudi®2 and Habana® Greco™. These new processors address an industry gap by providing customers with high-performance, high-efficiency deep learning compute choices for both training workloads and inference deployments in the data center while lowering the AI barrier to entry for companies of all sizes.

Filed Under: AI Deep Learning, Big Data, Big Data Hardware, Data Science, Featured, Google News Feed, Infrastructure, Intel, Machine Learning, News / Analysis, Uncategorized Tagged With: AI, Deep Learning, deep learning training, Habana, inference, Intel, Weekly Newsletter Articles

TensorRT 8 Provides Leading Enterprises Fast AI Inference Performance

July 20, 2021 by Editorial Team Leave a Comment

NVIDIA today launched TensorRT™ 8, the eighth generation of the company’s AI software, which slashes inference time in half for language queries — enabling developers to build the world’s best-performing search engines, ad recommendations and chatbots and offer them from the cloud to the edge.

Filed Under: AI Deep Learning, Big Data, Featured, Google News Feed, Healthcare, News / Analysis, Uncategorized Tagged With: chatbot, GPU, healthcare, inference, Nvidia, TensorRT, Weekly Newsletter Articles

NVIDIA Launches Inference Platforms for Large Language Models and Generative AI Workloads

Intel’s Habana Labs Launches Second-Generation AI Processors for Training and Inferencing

TensorRT 8 Provides Leading Enterprises Fast AI Inference Performance

Sponsored Guest Articles

Optimizing Performance and Cost Savings for Elastic on Pure Storage

White Papers

From complexity to clarity: Harnessing the power of AI/ML and risk-informed strategies to streamline clinical data management

Featured RSS Feed

More News from insideHPC