Get the latest tech news
What’s a NIM? Nvidia Inference Microservices is new approach to gen AI model deployment that could change the industry
Nvidia's new NIM packages inference engines, industry standard APIs and support for AI models into containers for easy model deployment.
The NIM technology marks a major milestone for gen AI deployment as the foundation of Nvidia’s next- generation strategy for inference that will have an impact on almost every model developer and data platform in the space. In response to a question from VentureBeat during the press briefing, Kari Briski, VP for gen AI software product management, emphasized that Nvidia is a platform company. “What we have found is that putting all these pieces together for a production environment to run gen AI at scale requires a lot of know-how and expertise, so that’s why we’ve packaged it together,” said Briski.
Or read this on Venture Beat