Get the latest tech news

Implementation of Google's Griffin Architecture – RNN LLM


Open weights language model from Google DeepMind, based on Griffin. - google-deepmind/recurrentgemma

RecurrentGemma is a family of open-weights Language Models by Google DeepMind, based on the novel Griffin architecture. To run these notebooks you will need to have a Kaggle account and first read and accept the Gemma license terms and conditions from the RecurrentGemma page. The code has been optimized for running on TPU using the Flax implementation, which contains a low level Pallas kernel to perform the linear scan in the recurrent layers.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Google

Google

Photo of implementation

implementation

Photo of griffin architecture

griffin architecture

Related news:

News photo

Google brings AI-powered editing tools, like Magic Editor, to all Google Photos users for free

News photo

TechCrunch Minute: Google’s Gemini Code Assist wants to use AI to help developers

News photo

How does Google's new Find My Device network actually work, and why should you care?