Get the latest tech news

Reddit Will Block the Internet Archive


Reddit says that it has caught AI companies scraping its data from the Internet Archive's Wayback Machine, so it's going to start blocking the Internet Archive from indexing the vast majority of Reddit. From a report: The Wayback Machine will no longer be able to crawl post detail pages, comments, o...

Reddit says that it has caught AI companies scraping its data from the Internet Archive's Wayback Machine, so it's going to start blocking the Internet Archive from indexing the vast majority of Reddit. From a report: The Wayback Machine will no longer be able to crawl post detail pages, comments, or profiles; instead, it will only be able to index the Reddit.com homepage, which effectively means Internet Archive will only be able to archive insights into which news headlines and posts were most popular on a given day. "Internet Archive provides a service to the open web, but we've been made aware of instances where AI companies violate platform policies, including ours, and scrape data from the Wayback Machine," spokesperson Tim Rathschmidt tells The Verge.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Reddit

Reddit

Photo of internet archive

internet archive

Related news:

News photo

Reddit is restricting its availability to the Internet Archive's Wayback Machine

News photo

Reddit is people! Which means its search might not be so damaged by AI slop

News photo

Belgium Bans Internet Archive's 'Open Library'