Get the latest tech news

OpenAI and Google accused of using YouTube transcripts for AI


Tech giants OpenAI and Google accused of using YouTube video transcripts to train AI, risking copyright infringement.

The report also suggests that OpenAI had depleted its supplies of useful data in 2021, and as a result, discussed transcribing podcasts, audiobooks and YouTube videos to train its next-generation model. By then, it is said that they had mined the computer code repository GitHub, and used up databases of chess moves and data describing high school tests and homework assignments from the website Quizlet. According to the Times, Meta is also facing a shortage of available training data, and in recordings reviewed by the publication, its AI team was heard discussing the unauthorized use of copyrighted materials in an effort to keep pace with OpenAI.

Get the Android app

Or read this on ReadWrite

Read more on:

Photo of Google

Google

Photo of YouTube

YouTube

Photo of OpenAI

OpenAI

Related news:

News photo

Google app's experimental bottom search bar on Android is slowly taking shape

News photo

Mozilla Asks: Will Google's Privacy Sandbox Protect Advertisers (and Google) More than You?

News photo

Want to watch the eclipse on April 8? Google has a free solution