Get the latest tech news
OpenAI and Google accused of using YouTube transcripts for AI
Tech giants OpenAI and Google accused of using YouTube video transcripts to train AI, risking copyright infringement.
The report also suggests that OpenAI had depleted its supplies of useful data in 2021, and as a result, discussed transcribing podcasts, audiobooks and YouTube videos to train its next-generation model. By then, it is said that they had mined the computer code repository GitHub, and used up databases of chess moves and data describing high school tests and homework assignments from the website Quizlet. According to the Times, Meta is also facing a shortage of available training data, and in recordings reviewed by the publication, its AI team was heard discussing the unauthorized use of copyrighted materials in an effort to keep pace with OpenAI.
Or read this on ReadWrite