Get the latest tech news

Did OpenAI, Google and Meta 'Cut Corners' to Harvest AI Training Data?


What happened when OpenAI ran out of English-language training data in 2021? They just created a speech recognition tool that could transcribe the audio from YouTube videos, reports The New York Times, as part of an investigation arguing that tech companies "including OpenAI, Google and Meta have ...

What happened when OpenAI ran out of English-language training data in 2021?They just created a speech recognition tool that could transcribe the audio from YouTube videos, reports The New York Times, as part of an investigation arguing that tech companies "including OpenAI, Google and Meta have cut corners, ignored corporate policies and debated bending the law" in their search for AI training data. At Meta, which owns Facebook and Instagram, managers, lawyers and engineers last year discussed buying the publishing house Simon & Schuster to procure long works, according to recordings of internal meetings obtained by the Times. "This is not organic data created by humans, but text, images and code that AI models produce — in other words, the systems learn from what they themselves generate."

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Google

Google

Photo of OpenAI

OpenAI

Photo of AI training data

AI training data

Related news:

News photo

Only $499 for the Pixel 8a is crazy. How can Google do it?

News photo

Google I/O 2024 will be all about AI again

News photo

OpenAI to unveil AI-powered search engine - one day before Google's crucial annual developer conference