Get the latest tech news
Getty Images drops ‘cleanest’ visual dataset for training foundation models
With the open dataset, Getty Images wants to address enterprises’ ML training woes and position itself as a credible data partner.
The creative company, known for enabling the sharing, discovery and purchase of visual content from global photographers and videographers, today announced it is releasing images from its library as a sample open dataset on Hugging Face. There’s also no hassle of cleaning or enrichment as the whole thing has been specifically curated for machine learning (ML) training with high-resolution images, supported by rich structured metadata, and no unwanted elements like NSFW content. Eventually, Getty hopes the move will engage the developer community, helping them understand the depth and breadth of content the company can offer, and raise awareness that it can be a “trusted partner” for providing licensed, high-quality data for responsible AI training.
Or read this on Venture Beat