Get the latest tech news

Training AI

From John Gruber today: It’s fair for public data to be excluded on an opt-out basis, rather than included on an opt-in one [...] No, no it’s not. This is a critical thing about ownership and copyright in the world.

Publishing text or images on the web does not make it fair game to train AI on. Besides that, I’d also add what I’ve seen no one else mention so far: People post content on web that they don’t own all the time. Whether reposting my content elsewhere is in good faith or not, it is now up someone other than me to decide whether or not to disallow AI training webcrawlers in their robots.txt file.

Get the Android app

Or read this on Hacker News