Get the latest tech news
Maxtext: A simple, performant and scalable Jax LLM
A simple, performant and scalable Jax LLM! Contribute to google/maxtext development by creating an account on GitHub.
MaxText achieves and scales from single host to very large clusters while staying simple and "optimization-free" thanks to the power of Jax and the XLA compiler. Ultimately MaxText has an MFU more than three times the 17% reported most recently with that codebase, is massively scalable and implements a key-value cache for efficient auto-regressive decoding. By contrast, MaxText is a simple, concrete implementation of various LLMs that encourages users to extend by forking and directly editing the source code.
Or read this on Hacker News