Get the latest tech news
CubeCL: GPU Kernels in Rust for CUDA, ROCm, and WGPU
Multi-platform high-performance compute language extension for Rust. - tracel-ai/cubecl
With CubeCL, you can program your GPU using Rust, taking advantage of zero-cost abstractions to develop maintainable, flexible, and efficient compute kernels. We currently have an highly optimized matrix multiplication module, leveraging Tensor Cores on NVIDIA hardware where available, while gracefully falling back to basic instructions on other platforms. However, since the algorithmic behavior may depend on the vectorization factor, CubeCL allows you to access it directly in the kernel when needed, without any performance loss, using the comptime system!
Or read this on Hacker News