CUDA

Read news on CUDA with our app.

TorchCodec 0.14: HDR Video Decoding for CPU and CUDA, and Fast Wav Decoder

Tiny hackable CUDA language model implementation

When does fragmentation occur in the CUDA caching allocator?

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA

CUDA Proves Nvidia Is a Software Company

NVIDIA Releases CUDA-Oxide 0.1 For Experimental Rust-To-CUDA Compiler

Taking on CUDA with ROCm: 'One Step After Another'

NVIDIA Adds Official Support For RHEL-Compatible Distributions Like AlmaLinux With CUDA 13.2

ZLUDA Boasts Full Llama.cpp Support, Better Windows Handling For CUDA On Non-NVIDIA GPUs

ZLUDA Adds CUDA 13.1 Compatibility For Running CUDA Apps On Non-NVIDIA Hardware

OpenCV 4.13 Brings More AVX-512 Usage, CUDA 13 Support, Many Other New Features

ZLUDA For CUDA On Non-NVIDIA GPUs Enables AMD ROCm 7 Support

VectorWare – from creators of `rust-GPU` and `rust-CUDA`

How to tile matrix multiplication (2023)

ZLUDA 5 Released With An Offline Compiler For CUDA On Non-NVIDIA GPUs

Were RNNs all we needed? A GPU programming perspective

AMD tries to catch CUDA with performance-boosting ROCm 7 software

You can get Nvidia's CUDA on three popular enterprise Linux distros now - why it matters

Accelerated Game of Life with CUDA / Triton