Get the latest tech news

Getty Images drops ‘cleanest’ visual dataset for training foundation models


With the open dataset, Getty Images wants to address enterprises’ ML training woes and position itself as a credible data partner.

The creative company, known for enabling the sharing, discovery and purchase of visual content from global photographers and videographers, today announced it is releasing images from its library as a sample open dataset on Hugging Face. There’s also no hassle of cleaning or enrichment as the whole thing has been specifically curated for machine learning (ML) training with high-resolution images, supported by rich structured metadata, and no unwanted elements like NSFW content. Eventually, Getty hopes the move will engage the developer community, helping them understand the depth and breadth of content the company can offer, and raise awareness that it can be a “trusted partner” for providing licensed, high-quality data for responsible AI training.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of getty images

getty images

Photo of foundation models

foundation models

Photo of visual dataset

visual dataset

Related news:

News photo

Shutterstock releases generative 3D, Getty Images upgrades service powered by Nvidia

News photo

Former Velodyne CEO’s delivery robot startup is ditching LiDAR for foundation models

News photo

Picsart partners with Getty Images to develop a custom AI model