Get the latest tech news

Web-Scraping AI Bots Cause Disruption For Scientific Databases and Journals


Automated web-scraping bots seeking training data for AI models are flooding scientific databases and academic journals with traffic volumes that render many sites unusable. The online image repository DiscoverLife, which contains nearly 3 million species photographs, started receiving millions of d...

Automated web-scraping bots seeking training data for AI models are flooding scientific databases and academic journals with traffic volumes that render many sites unusable. The online image repository DiscoverLife, which contains nearly 3 million species photographs, started receiving millions of daily hits in February this year that slowed the site to the point that it no longer loaded, Nature reported Monday.The surge has intensified since the release of DeepSeek, a Chinese large language model that demonstrated effective AI could be built with fewer computational resources than previously thought. The Confederation of Open Access Repositories reported that more than 90% of 66 surveyed members experienced AI bot scraping, with roughly two-thirds suffering service disruptions.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Web

Web

Photo of bots

bots

Photo of disruption

disruption

Related news:

News photo

Using lots of little tools to aggressively reject the bots

News photo

Hyundai's Metaplant Seeks to Transform the EV Industry with More Bots

News photo

Nancy Mace's Former Staff Claim She Had Them Create Burner Accounts to Promote Her. According to former staffers and a deposition, Nancy Mace has allegedly used her tech background to deploy bots across social media—and asked staffers to surreptitiously post on her behalf.