Get the latest tech news

One million public Bluesky posts scraped for AI training


The data was posted to an AI company, then later removed after an outcry.

Reported by 404Media on Nov. 26, one million public Bluesky posts — complete with identifying user information — were crawled and then uploaded to AI company Hugging Face. The platform's firehose API is an "aggregated, chronological stream of all the public data updates as they happen in the network, including posts, likes, follows, handle changes, and more." This could be a major warning sign to many of the site's millions of new users, many of whom left competitor X in the wake of an alarming new AI training policy.

Get the Android app

Or read this on Mashable

Read more on:

Photo of AI training

AI training

Photo of public Bluesky posts

public Bluesky posts

Related news:

News photo

Bluesky’s open API means anyone can scrape your data for AI training

News photo

Indian news agency ANI sues OpenAI for unsanctioned content use in AI training

News photo

HarperCollins is asking authors to license their books for AI training