Get the latest tech news

Implementation of Google's Griffin Architecture – RNN LLM

Open weights language model from Google DeepMind, based on Griffin. - google-deepmind/recurrentgemma

RecurrentGemma is a family of open-weights Language Models by Google DeepMind, based on the novel Griffin architecture. To run these notebooks you will need to have a Kaggle account and first read and accept the Gemma license terms and conditions from the RecurrentGemma page. The code has been optimized for running on TPU using the Flax implementation, which contains a low level Pallas kernel to perform the linear scan in the recurrent layers.

Get the Android app

Or read this on Hacker News