Get the latest tech news

Making a vintage LLM from scratch


In this blog post, I will share the adventures I had creating my own LLM, from (almost) scratch, trained only on old texts. I made my own base-training and fine-tuning scripts, data processing pipelines and custom datasets.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Scratch

Scratch

Photo of vintage LLM

vintage LLM

Related news:

News photo

Build a Basic AI Agent from Scratch: Long Task Planning

News photo

Researchers say they trained a foundation model from scratch for about $1,500

News photo

CS336: Language Modeling from Scratch