- Anker's newest USB-C cables are a smart way to future-proof your tech
- Deal alert: Our favorite noise-canceling headphones of 2024 are at their lowest price ever for Black Friday
- Enhancing Container Security with Docker Scout and Secure Repositories | Docker
- Where to find the best Linux support, no matter your skill level: 5 options
- Why the Meta Quest 3S is the ultimate 2024 holiday present
VMware and NVIDIA Unlock Generative AI for Enterprises
New VMware Private AI Foundation With NVIDIA Enables Enterprises to Ready Their Businesses for Generative AI; Platform to Further Support Data Privacy, Security and Control LAS VEGAS–(BUSINESS WIRE)– VMware Explore—VMware, Inc. (NYSE: VMW) and NVIDIA (NASDAQ: NVDA) today announced the expansion of their strategic partnership to ready the hundreds of thousands of enterprises that run on …
Tue, 22 Aug 2023 00:00:00
New VMware Private AI Foundation With
VMware Explore—VMware, Inc. (NYSE: VMW) and NVIDIA (NASDAQ: NVDA) today announced the expansion of their strategic partnership to ready the hundreds of thousands of enterprises that run on VMware’s cloud infrastructure for the era of generative AI.
“Generative AI and multi-cloud are the perfect match,” said
“Enterprises everywhere are racing to integrate generative AI into their businesses,” said
Full-Stack Computing to Supercharge Generative AI
To achieve business benefits faster, enterprises are seeking to streamline development, testing and deployment of generative AI applications. McKinsey estimates that generative AI could add up to
The platform is expected to include integrated AI tools to empower enterprises to run proven models trained on their private data in a cost-efficient manner. To be built on
- Privacy — Will enable customers to easily run AI services adjacent to wherever they have data with an architecture that preserves data privacy and enables secure access.
- Choice — Enterprises will have a wide choice in where to build and run their models — from NVIDIA NeMo™ to Llama 2 and beyond — including leading OEM hardware configurations and, in the future, on public cloud and service provider offerings.
- Performance — Running on NVIDIA accelerated infrastructure will deliver performance equal to and even exceeding bare metal in some use cases, as proven in recent industry benchmarks.
- Data-Center Scale — GPU scaling optimizations in virtualized environments will enable AI workloads to scale across up to 16 vGPUs/GPUs in a single virtual machine and across multiple nodes to speed generative AI model fine-tuning and deployment.
-
Lower Cost — Will maximize usage of all compute resources across GPUs, DPUs and CPUs to lower overall costs, and create a pooled resource environment that can be shared efficiently across teams. -
Accelerated Storage —
VMware vSAN Express Storage Architecture will provide performance-optimized NVMe storage and supports GPUDirect® storage over RDMA, allowing for direct I/O transfer from storage to GPUs without CPU involvement. - Accelerated Networking — Deep integration between vSphere and NVIDIA NVSwitch™ technology will further enable multi-GPU models to execute without inter-GPU bottlenecks.
- Rapid Deployment and Time to Value — vSphere Deep Learning VM images and image repository will enable fast prototyping capabilities by offering a stable turnkey solution image that includes frameworks and performance-optimized libraries pre-installed.
The platform will feature NVIDIA NeMo, an end-to-end, cloud-native framework included in NVIDIA AI Enterprise — the operating system of the NVIDIA AI platform — that allows enterprises to build, customize and deploy generative AI models virtually anywhere. NeMo combines customization frameworks, guardrail toolkits, data curation tools and pretrained models to offer enterprises an easy, cost-effective and fast way to adopt generative AI.
For deploying generative AI in production, NeMo uses TensorRT for Large Language Models (TRT-LLM), which accelerates and optimizes inference performance on the latest LLMs on NVIDIA GPUs. With NeMo,
At VMware Explore 2023, NVIDIA and
Broad Ecosystem Support for VMware Private AI Foundation With NVIDIA
The NVIDIA L40S GPU enables up to 1.2x more generative AI inference performance and up to 1.7x more training performance compared with the NVIDIA A100 Tensor Core GPU.
NVIDIA BlueField-3 DPUs accelerate, offload and isolate the tremendous compute load of virtualization, networking, storage, security and other cloud-native AI services from the GPU or CPU.
NVIDIA ConnectX-7 SmartNICs deliver smart, accelerated networking for data center infrastructure to boost some of the world’s most demanding AI workloads.
Availability
Citations
1-“The economic potential of generative AI: The next productivity frontier,” McKinsey, 2023
About NVIDIA
Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling industrial digitalization across markets. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry. More information at https://nvidianews.nvidia.com/.
About
Certain statements in this press release including, but not limited to, statements as to: the benefits, impact, performance, features and availability of our products and technologies, including NVIDIA AI Enterprise, NVIDIA NeMo, Llama 2, TensorRT, the NVIDIA L40S GPU, NVIDIA BlueField-3 DPUs, and NVIDIA ConnectX-7 SmartNICs; NVIDIA’s partnership with
Many of the products and features described herein remain in various stages and will be offered on a when-and-if-available basis. The statements above are not intended to be, and should not be interpreted as a commitment, promise, or legal obligation, and the development, release, and timing of any features or functionalities described for our products is subject to change and remains at the sole discretion of NVIDIA. NVIDIA will have no liability for failure to deliver or delay in the delivery of any of the products, features or functions set forth herein.
© 2023
View source version on businesswire.com: https://www.businesswire.com/news/home/20230822023375/en/
+1 650 427 6145
eontiveros@vmware.com
+1-310-920-9642
smcphee@nvidia.com
Source: