Get the latest tech news
Training AI
From John Gruber today: It’s fair for public data to be excluded on an opt-out basis, rather than included on an opt-in one [...] No, no it’s not. This is a critical thing about ownership and copyright in the world.
Publishing text or images on the web does not make it fair game to train AI on. Besides that, I’d also add what I’ve seen no one else mention so far: People post content on web that they don’t own all the time. Whether reposting my content elsewhere is in good faith or not, it is now up someone other than me to decide whether or not to disallow AI training webcrawlers in their robots.txt file.
Or read this on Hacker News