Get the latest tech news

GPU Prefix Sums: A nearly complete collection


A nearly complete collection of prefix sum algorithms implemented in CUDA, D3D12, Unity and WGPU. Theoretically portable to all wave/warp/subgroup sizes. - GitHub - b0nes164/GPUPrefixSums: A nearl...

GPUPrefixSums aims to bring state-of-the-art GPU prefix sum techniques from CUDA and make them available in portable compute shaders. In Decoupled Fallback, a threadblock will spin for a set amount of cycles while waiting for the reduction of a preceding partition tile. The prefix sum is one of the most important algorithmic primitives in parallel computing, underpinning everything from sorting, to compression, to graph traversal.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of State

State

Photo of GPU

GPU

Photo of art GPU prefix sum

art GPU prefix sum

Related news:

News photo

Google issued ‘State-backed attack in progress’ warnings after spotting web hijack scheme

News photo

Dissecting the Apple M1 GPU, the end

News photo

Russia Orders State-Backed WhatsApp Rival Pre-Installed On Phones and Tablets