Get the latest tech news
OpenBLAS 0.3.29 Brings Auto-Detection For Intel Granite Rapids, Apple M4 & AMD Zen 5
y as a big update for this widely-used, open-source implementation for Basic Linear Algebra Subprograms and LAPACK APIs. OpenBLAS 0.3.29 brings improved thread scaling for multi-threaded SBGEMV and TRTRI, various multi-threaded fixes, improved documentation, and other general fixes.
OpenBLAS 0.3.29 is out today as a big update for this widely-used, open-source implementation for Basic Linear Algebra Subprograms and LAPACK APIs. OpenBLAS 0.3.29 brings improved thread scaling for multi-threaded SBGEMV and TRTRI, various multi-threaded fixes, improved documentation, and other general fixes. On the x86_64 side for OpenBLAS 0.3.29 there is CPU auto-detection for Intel Granite Rapids processors, auto-detection for AMD Zen 5 series processors, optimized SOMATCOPY_CT for AVX-capable targets, and a variety of other fixes/optimizations.
Or read this on Phoronix