Get the latest tech news
Show HN: Penny-1.7B Irish Penny Journal style transfer
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Penny‑1.7B is a 1.7 billion‑parameter causal language model fine‑tuned with Group Relative Policy Optimization (GRPO) to emulate the 19ᵗʰ‑century prose of the Irish Penny Journal(1840). The RL stage ran for 6,800 policy steps, using a reward model trained to classify sentences as original IPJ vs modern translation. Maximizing this score nudges generations toward authentic Victorian‑era diction while retaining the general reasoning ability of the base SmolLM2 model.
Or read this on Hacker News