Get the latest tech news

Did OpenAI, Google and Meta 'Cut Corners' to Harvest AI Training Data?

What happened when OpenAI ran out of English-language training data in 2021?They just created a speech recognition tool that could transcribe the audio from YouTube videos, reports The New York Times, as part of an investigation arguing that tech companies "including OpenAI, Google and Meta have cut corners, ignored corporate policies and debated bending the law" in their search for AI training data. At Meta, which owns Facebook and Instagram, managers, lawyers and engineers last year discussed buying the publishing house Simon & Schuster to procure long works, according to recordings of internal meetings obtained by the Times. "This is not organic data created by humans, but text, images and code that AI models produce — in other words, the systems learn from what they themselves generate."

Get the Android app

Or read this on Slashdot