Get the latest tech news

This Week in AI: Tech giants embrace synthetic data

In this edition of This Week in AI, TechCrunch's AI newsletter, we take a look at the big tech companies using synthetic data to train flagship models.

“We used novel synthetic data generation techniques, such as distilling outputs from OpenAI’s o1-preview, to fine-tune the GPT-4o to open canvas, make targeted edits, and leave high-quality comments inline,” ChatGPT head of product Nick Turley wrote in a post on X. In developing Movie Gen, a suite of AI-powered tools for creating and editing video clips, Meta partially relied on synthetic captions generated by an offshoot of its Llama 3 models. Aleksei Bochkovskii et al. share a method for zero-shot monocular depth estimation with high detail, meaning it uses a single camera, doesn’t need to be trained on specific things (like it works on a camel despite never seeing one), and catches even difficult aspects like tufts of hair.

Get the Android app

Or read this on TechCrunch