Read news on CUDA with our app.
Read more in the app
TorchCodec 0.14: HDR Video Decoding for CPU and CUDA, and Fast Wav Decoder
Tiny hackable CUDA language model implementation
When does fragmentation occur in the CUDA caching allocator?
Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA
CUDA Proves Nvidia Is a Software Company
NVIDIA Releases CUDA-Oxide 0.1 For Experimental Rust-To-CUDA Compiler
Taking on CUDA with ROCm: 'One Step After Another'
NVIDIA Adds Official Support For RHEL-Compatible Distributions Like AlmaLinux With CUDA 13.2
ZLUDA Boasts Full Llama.cpp Support, Better Windows Handling For CUDA On Non-NVIDIA GPUs
ZLUDA Adds CUDA 13.1 Compatibility For Running CUDA Apps On Non-NVIDIA Hardware
OpenCV 4.13 Brings More AVX-512 Usage, CUDA 13 Support, Many Other New Features
ZLUDA For CUDA On Non-NVIDIA GPUs Enables AMD ROCm 7 Support
VectorWare – from creators of `rust-GPU` and `rust-CUDA`
How to tile matrix multiplication (2023)
ZLUDA 5 Released With An Offline Compiler For CUDA On Non-NVIDIA GPUs
Were RNNs all we needed? A GPU programming perspective
AMD tries to catch CUDA with performance-boosting ROCm 7 software
You can get Nvidia's CUDA on three popular enterprise Linux distros now - why it matters
Accelerated Game of Life with CUDA / Triton