Get the latest tech news

UCCL-EP: DeepEP-style expert parallelism on any NIC, no GPU-initiated comms


How UCCL reimplements the DeepEP kernels for arbitrary hardware by swapping the transport out from under them.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of GPU

GPU

Photo of NIC

NIC

Photo of initiated comms

initiated comms

Related news:

News photo

Building a Korean ambiguity solver fast enough to skip the GPU: 7,300 words/SEC

News photo

NVIDIA challenger D-Matrix claims AI chip 10 times faster than GPU using 5 times less energy, skips DRAM for SRAM

News photo

Intel's mysterious new datacenter GPU is what Nvidia's Rubin CPX nearly was