Get the latest tech news
Meta's AI memorised books verbatim – that could cost it billions
Many AI models were trained on the text of books, but a new test found at least one model has directly memorised nearly the entirety of some books, including Harry Potter and the Philosopher’s Stone, which could complicate ongoing legal battles over copyright infringement
Billions of dollars are at stake as courts in the US and UK decide whether tech companies can legally train their artificial intelligence models on copyrighted books. Authors and publishers have filed multiple lawsuits over this issue, and in a new twist, researchers have shown that at least one AI model has not only used popular books in its training data, but also memorised their contents verbatim. Such testing revealed that Meta’s Llama 3.1 70B model has memorised most of the first book in J. K. Rowling’s Harry Potter series, as well as The Great Gatsby and George Orwell’s dystopian novel 1984.
Or read this on r/technology