Get the latest tech news
Executing programs inside transformers with exponentially faster inference
We build a computer inside a transformer — executing arbitrary C programs for millions of steps with exponentially faster inference via 2D attention heads.
None
Or read this on Hacker News