Get the latest tech news

Parallel Reduce and Scan on the GPU


Programming, GPU, Optimizations, Algorithms

They are basic building blocks for more complex algorithms, e.g. solving linear equations or stream compaction. Since workgroups can’t be synchronized between each other, we’ll need to use atomicAdd or simply run the entire algorith in multiple passes. Similarily to reduce, each invocation in the subgroup will receive the partial sum corresponding to its index (in increasing order).

Get the Android app

Or read this on Hacker News

Read more on:

Photo of GPU

GPU

Photo of parallel reduce

parallel reduce

Related news:

News photo

Show HN: Luminal – Open-source, search-based GPU compiler

News photo

Capcom asks PC Monster Hunter Wilds players to wait until Title Update 4 this winter for "CPU and GPU related optimizations"

News photo

Steam's new Performance Overlay is more accurate than Task Manager, says Valve — GPU utilization monitoring comes directly to Steam in latest client beta