Get the latest tech news

Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture


In this blog post, we walk through how to use Matrix Cores in HIP kernels, with a focus on low-precision data types such as FP16, FP8, and FP4, as well as the new family of Matrix Core instructions with exponent block scaling introduced in the AMD CDNA™4 architecture. Through code examples and illustrations, we provide the necessary knowledge to start programming Matrix Cores, covering modern low-precision floating-point types, the Matrix Core compiler intrinsics, and the data layouts required by the Matrix Core instructions.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of matrix

matrix

Photo of amd gpus

amd gpus

Photo of core programming

core programming

Related news:

News photo

Secure chat darling Matrix admits pair of 'high severity' protocol flaws need painful fixes

News photo

Matrix Is Not Safe for EU Data Privacy

News photo

Why not Matrix (2023)