Get the latest tech news

Meta's AI memorised books verbatim – that could cost it billions


Many AI models were trained on the text of books, but a new test found at least one model has directly memorised nearly the entirety of some books, including Harry Potter and the Philosopher’s Stone, which could complicate ongoing legal battles over copyright infringement

Billions of dollars are at stake as courts in the US and UK decide whether tech companies can legally train their artificial intelligence models on copyrighted books. Authors and publishers have filed multiple lawsuits over this issue, and in a new twist, researchers have shown that at least one AI model has not only used popular books in its training data, but also memorised their contents verbatim. Such testing revealed that Meta’s Llama 3.1 70B model has memorised most of the first book in J. K. Rowling’s Harry Potter series, as well as The Great Gatsby and George Orwell’s dystopian novel 1984.

Get the Android app

Or read this on r/technology

Read more on:

Photo of Meta

Meta

Photo of billions

billions

Photo of memorised books

memorised books

Related news:

News photo

Meta sues 'nudify' app-maker that it claims ran 87k+ Facebook, Instagram ads

News photo

The ‘death of creativity’? AI job fears stalk advertising industry | WPP and others roll out AI-generated campaigns as Facebook owner Meta plans to let firms create their own ads

News photo

Scale AI Picks New CEO as Wang Set to Join Meta After Investment