Get the latest tech news
AMD's CDNA 4 Architecture Announcement
CDNA 4 is AMD’s latest compute oriented GPU architecture, and represents a modest update over CDNA 3.
Thanks to a mature software ecosystem and a heavy focus on matrix multiplication throughput (tensor cores), Nvidia could often get close ( https://chipsandcheese.com/p/testing-amds-giant-mi300x) to the nominally far larger MI300X. Nvidia’s focus on machine learning and matrix operations keeps them very competitive in that category, despite having fewer SMs running at lower clocks. Or, AMD could be doing something special to let the L2 transition a line to clean state if written data is likely to be read by other threads across the system, but isn’t expected to be modified again anytime soon.
Or read this on Hacker News