Read news on n-gram with our app.
Read more in the app
Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding