Get the latest tech news
Making a vintage LLM from scratch
In this blog post, I will share the adventures I had creating my own LLM, from (almost) scratch, trained only on old texts. I made my own base-training and fine-tuning scripts, data processing pipelines and custom datasets.
None
Or read this on Hacker News