Get the latest tech news

AMD GPU Operator Announced For Automated Driver Installation & Kubernetes Support


AMD today announced two new software projects to better enhance their software support for Instinct accelerators / graphics deployments within the data center: AMD GPU Operator and AMD Metrics Exporter.

AMD GPU Operator allows for the automated driver installation and management for the AMD driver / ROCm compute stack, easy deployment of AMD GPU device plug-ins, simplified GPU resource allocation for containers, automatic worker node labeling, and support for the upstream/vanilla Kubernetes. AMD GPU Operator aims to deliver a "zero-touch GPU setup" with its automatic ROCm driver management while being paired with enterprise-minded features to make the initial deployment and ongoing maintenance much easier for AMD hardware within varying sizes of AI and HPC deployments. AMD GPU Operator quietly saw its v1.0 release this past November and the AMD Device Metrics Exporter celebrated its v1.0 release in December but the software was only "announced" today via the ROCm blog.

Get the Android app

Or read this on Phoronix

Read more on:

Photo of Kubernetes

Kubernetes

Photo of amd gpu operator

amd gpu operator

Photo of gpu operator

gpu operator

Related news:

News photo

So you wanna write Kubernetes controllers?

News photo

Kubernetes horizontal pod autoscaling powered by an OpenTelemetry-native tool

News photo

Proton worldwide outage caused by Kubernetes migration, software change