Get the latest tech news

AI models collapse when trained on recursively generated data


 Analysis shows that indiscriminately training generative artificial intelligence on real and generated content, usually done by scraping data from the Internet, can lead to a collapse in the ability of the models to generate diverse high-quality output.

We also briefly mention two close concepts to model collapse from the existing literature: catastrophic forgetting arising in the framework of task-free continual learning 7 and data poisoning 8, 9 maliciously leading to unintended behaviour. Gen 5: ism, which had been translated into more than 100 languages including English, French, German, Italian, Spanish, Portuguese, Dutch, Swedish, Norwegian, Polish, Hungarian, Slovak, Lithuanian, Estonian, Finnish, Romanian, Bulgarian, Turkish, Croatian, Serbian, Ukrainian, Russian, Kazakh, Kyrgyz. For example, we saw the creation of click, content and troll farms, a form of human ‘language models’, whose job is to misguide social networks and search algorithms.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of AI models

AI models

Photo of generated data

generated data

Related news:

News photo

MIT researchers advance automated interpretability in AI models

News photo

Qualcomm makes its AI models available to app developers

News photo

OpenAI used a game to help AI models explain themselves better