Exciting News from Microsoft Ignite 2023: 7 Big Updates from NVIDIA!
This year, Microsoft and NVIDIA joined forces for a ten-year partnership in the world of generative AI. With NVIDIA's hardware expertise and Microsoft's collaboration with OpenAI, the two companies are making waves in the AI landscape. At the ongoing Ignite conference, Microsoft shared seven noteworthy announcements about NVIDIA. Let's break it down:
1. New Azure Virtual Machines for AI Power:
Microsoft introduced the NC H100 v5 VM series for Azure, featuring the industry's first cloud instances with NVIDIA H100 NVL GPUs. These virtual machines are a game-changer for mid-range AI workloads, offering significant performance boosts, especially for models like GPT-3 175B.
2. Confidential Virtual Machines for Enhanced Security:
Microsoft is expanding its NVIDIA-powered services with the NCC H100 v5 VMs. These confidential virtual machines use NVIDIA H100 Tensor Core GPUs, ensuring data and application security while providing accelerated performance. They will soon be available for private preview on Azure.
3. AI Foundry Service for Custom Generative AI:
NVIDIA launched an AI foundry service to supercharge the development of custom generative AI applications. This service, available on Microsoft Azure, provides enterprises and startups with an end-to-end solution for creating and deploying custom generative AI models, including partnerships with companies like Amdocs.
4. Democratizing Access to AI Foundation Models:
Microsoft and NVIDIA are making AI Foundation Models more accessible to developers. Users can experience these models through a user-friendly interface or API directly from a browser. Models like Llama 2, Stable Diffusion XL, and Mistral can be customized with proprietary data, optimized for NVIDIA GPU-accelerated stacks.
5. Simulation Engines for Omniverse Cloud:
NVIDIA launched two new simulation engines on Omniverse Cloud hosted on Microsoft Azure: the virtual factory simulation engine and the autonomous vehicle simulation engine. These engines aim to save costs and reduce lead times for automotive companies transitioning to AI-enhanced digital systems.
6. Upcoming TensorRT-LLM Update:
An upcoming update to TensorRT-LLM will add support for new large language models. This open-source software enhances AI inference performance and will soon be compatible with OpenAI's Chat API. The release promises improved inference performance, up to 5x faster, and support for additional popular language models.
In summary, Microsoft and NVIDIA's collaboration is bringing some groundbreaking advancements in the world of AI, making powerful tools more accessible and secure for developers and businesses alike.