Get the latest tech news

Perplexity accused of scraping websites that explicitly blocked AI scraping


Internet giant Cloudflare says it detected Perplexity crawling and scraping websites, even after customers had added technical blocks telling Perplexity not to scrape their pages.

The network infrastructure giant accused Perplexity of obscuring its identity when trying to scrape web pages “in an attempt to circumvent the website’s preferences,” Cloudflare’s researchers wrote. In recent times, websites have tried to fight back by using the web standard Robots.txt file, which tells search engines and AI companies which pages can be indexed and which shouldn’t, efforts that have seen mixed results so far. Cloudflare’s chief executive Matthew Prince sounded the alarm at the time, saying AI is breaking the business model of the internet, particularly publishers.

Get the Android app

Or read this on TechCrunch

Read more on:

Photo of Websites

Websites

Photo of Perplexity

Perplexity

Related news:

News photo

Perplexity is Using Stealth, Undeclared Crawlers To Evade Website No-Crawl Directives, Cloudflare Says

News photo

Perplexity is using stealth, undeclared crawlers to evade no-crawl directives

News photo

Google tool misused to scrub tech CEO’s shady past from search | Google has fixed the bug, which it says affected only "a tiny fraction of websites."