Get the latest tech news

Leaked Documents Show Nvidia Scraping ‘A Human Lifetime’ of Videos Per Day to Train AI


Internal emails, Slack conversations and documents obtained by 404 Media show how Nvidia created a yet-to-be-released video foundational model.

404 Media is an independent website whose work is written, reported, and owned by human journalists and whose intended audience is real people, not AI scrapers, bots, or a search algorithm. Nvidia scraped videos from Youtube and several other sources to compile training data for its AI products, internal Slack chats, emails, and documents obtained by 404 Media show. When asked about legal and ethical aspects of using copyrighted content to train an AI model, Nvidia defended its practice as being “in full compliance with the letter and the spirit of copyright law.” Internal conversations at Nvidia viewed by 404 Media show when employees working on the project raised questions about potential legal issues surrounding the use of datasets compiled by academics for research purposes and YouTube videos, managers told them they had clearance to use that content from the highest levels of the company.

Get the Android app

Or read this on r/technology

Read more on:

Photo of Nvidia

Nvidia

Photo of Videos

Videos

Photo of Day

Day

Related news:

News photo

Nvidia driver update causes BSOD on older Windows PCs

News photo

Nvidia Allegedly Scraped YouTube, Netflix Videos for AI Training Data

News photo

Nvidia Shares Tumble After Reports of a Chip Delay