Get the latest tech news
What it takes to transpose a matrix
Introduction Classical CPU architecture is a poor choice for performing matrix-oriented computations. Developers have to spend much effort to create efficient algorithms even for the problems appearing trivial on the surface.
None
Or read this on Hacker News