Get the latest tech news
AVX-512 Performance With 256-bit vs. 512-bit Data Path For AMD EPYC 9005 CPUs
Now past the launch day for the AMD EPYC 9005 series server processors and having delivered initial AMD EPYC Zen 5 benchmarks for the EPYC 9575F / EPYC 9755 / EPYC 9965 SKUs, it's onto one of my favorite areas of testing and that is the more focused benchmarks looking at different specific changes/features of new processors.
With the 5th Gen AMD EPYC "Turin" processors they all now enjoy a 512-bit data path for faster Advanced Vector Extensions 512 usage. Zen 4 with AMD's original AVX-512 implementation relied on a 256-bit "double pumped" approach that worked well and proved to be very efficient. Page 1 - IntroductionPage 2 - miniBUDE + NAMD + GROMACSPage 3 - simdjson + Embree + OpenVKL + OSPRayPage 4 - OSPRay Studio + Y-Cruncher + Xmrig + SMHasher + oneDNNPage 5 - PyTorch + TensorFlow + OpenVINO AIPage 6 - libxsmm + ONNX + Numpy + SVT-AV1Page 7 - Overall AVX-512 Metrics
Or read this on Phoronix