Read news on scale llms with our app.
Read more in the app
Serving 70B-scale LLMs efficiently on low-resource edge devices [pdf]