Get the latest tech news

Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch


GPT-2-style LLM built from scratch in C/CUDA with hand-written backprop, BPE tokenizer, FlashAttention, pretraining, and SFT. - JustVugg/nanoeuler

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Scratch

Scratch

Photo of CUDA

CUDA

Photo of pure C

pure C

Related news:

News photo

Robotics Teams Are Rebuilding the Data Stack from Scratch

News photo

Show HN: I wrote a C++ ray tracer from scratch without AI

News photo

TorchCodec 0.14: HDR Video Decoding for CPU and CUDA, and Fast Wav Decoder