Get the latest tech news

Bluesky users debate plans around user data and AI training


Social network Bluesky recently published a proposal on GitHub outlining new options it could give users to indicate whether they want their posts and

Social network Bluesky recently published a proposal on GitHub outlining new options it could give users to indicate whether they want their posts and data to be scraped for things like generative AI training and public archiving. Debates about AI training and copyright have dragged robots.txt into the spotlight, among other things highlighting the fact that it’s not legally enforceable. Under the proposal, users of the Bluesky app, or other apps that use the underlying ATProtocol, could go into their settings and allow or disallow the usage of their Bluesky data across four categories: generative AI, protocol bridging (i.e., connecting different social ecosystems), bulk datasets, and web archiving (such as the Internet Archive’s Wayback Machine).

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of AI training

AI training

Photo of user data

user data

Photo of Bluesky users

Bluesky users

Related news:

News photo

Google joins OpenAI in pushing feds to codify AI training as fair use | Google says it just wants "balanced" copyright rules.

News photo

OpenAI calls for US government to codify ‘fair use’ for AI training

News photo

S. Korean top court rejects Meta's appeal over user data sharing fine