Get the latest tech news

MobileLLM: Optimizing Sub-Billion Parameter Language Models for On-Device Use


MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024. - facebookresearch/MobileLLM

In this work, we comprehensively consider multiple design factors to obtain high-quality LLMs with fewer than a billion parameters. We integrated (1) SwiGLU activation function, (2) deep and thin architectures, (3) embedding sharing, (4) grouped-query attention to build MobileLLM. This script can be modified to adjust the--nnodes parameter and other settings to suit different multi-node configurations, such as those using slurm or torchx.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of use

use

Photo of device

device

Photo of mobilellm

mobilellm

Related news:

News photo

Pioneering Code of Practice released for use of stem cell-based embryo models in research

News photo

Pestle’s app can now save recipes from Reels using on-device AI

News photo

Microsoft demands China staff use iPhones not Android phones – report