Get the latest tech news

3 ways Meta's Llama 3.1 is an advance for Gen AI


Three key design decisions by Meta scientists represent a tour de force in the engineering of increasingly large neural networks.

Meta tested different combinations of the compute intensity and the amount of data to find sweet spots where the mixture reached optimal performance on "downstream" benchmark tasks. Meta PropertiesThe important part is that the iterative process of validating each successive data and compute combination is what leads to the selection of the 405 billion parameters as the sweet spot. Although reasonable parties can disagree on how strictly they recognize the open-source nature of Llama 3.1, the fact that so much detail is offered about the training process of the model is itself a welcome trove of disclosure.

Get the Android app

Or read this on ZDNet

Read more on:

Photo of Meta

Meta

Photo of Ways

Ways

Photo of Gen AI

Gen AI

Related news:

News photo

Backed by Microsoft, AWS and Meta, the Overture Maps Foundation launches its first open map data sets

News photo

Meta Removes 63,000 Accounts Linked to Sextortion Scammers

News photo

Llama 3.1 in C