Get the latest tech news

AMD Unveils Its First Small Language Model AMD-135M


In the ever-evolving landscape of artificial intelligence, large language models (LLMs) like GPT-4 and Llama have garnered significant attention for their impressive capabilities in natural language processing and generation. However, small language models (SLMs) are emerging as an essential counter...

This work demonstrates the commitment to an open approach to AI which will lead to more inclusive, ethical, and innovative technological progress, helping ensure that its benefits are more widely shared, and its challenges more collaboratively addressed. However, a major limitation of this approach is that each forward pass can only generate a single token, resulting in low memory access efficiency and affecting overall inference speed. This approach allows each forward pass to generate multiple tokens without compromising performance, thereby significantly reducing memory access consumption, and enabling several orders of magnitude speed improvements.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of AMD

AMD

Photo of small language model

small language model

Photo of amd-135

amd-135

Related news:

News photo

AMD Releases AMD-135M: An Open-Source Small Language Model

News photo

Early Linux 6.12 Kernel Benchmarks Showing Some Nice Gains On AMD Zen 5

News photo

Intel Xeon 6980P vs. AMD EPYC Power Efficiency / Performance-Per-Watt Benchmarks