IBM Cloud speeds AI workloads with Intel Gaudi 3 accelerators

For businesses that need more control over their AI development, IBM says they can deploy IBM watsonx.ai software with the Intel Gaudi 3-based virtual server on IBM Cloud VPC in Q2 2025. IBM watsonx.ai includes an end-to-end AI development studio, AI developer toolkit and full AI lifecycle management for developing AI services and deploying them into customers applications. 

“As AI is increasingly moving from an experimental trend to the backbone of real world applications, IT organizations are challenged with balancing the necessary performance with economic considerations of AI hardware, and doing so at scale,” wrote Mitch Lewis, a performance analyst with Signal65, which offers tech industry testing, performance validation, and data-based consulting.

“Previous analysis by Signal65 demonstrated that Intel Gaudi 3 accelerators were capable of offering highly competitive performance for AI inferencing workloads, while offering substantial economic advantages. The availability of Gaudi 3 accelerators on IBM Cloud looks to build upon these advantages while providing IT organizations with an easily accessible and scalable cloud-based approach to deploying AI applications,” Lewis wrote in a blog post about the Intel Gaudi 3 AI accelerator implementation on IBM Cloud.

“This preliminary performance testing conducted by Signal65 found Intel Gaudi 3 to offer highly competitive performance when compared to alternative Nvidia-based offerings on IBM Cloud. Gaudi 3 on IBM Cloud provides a flexible platform capable of achieving high performance across various models and technical configurations,” Lewis wrote. “In addition, the pricing of Gaudi 3 instances on IBM Cloud builds an appealing economic advantage over both Nvidia instance types” – the Nvidia H100 and H200 – that IBM Cloud also supports, Lewis stated.



Source link

Leave a Comment