AI crawler wars threaten the web

AI crawler wars threaten to make the web more closed for everyone

AI crawler wars threaten to make the web more closed for everyone There’s an accelerating cat-and-mouse game between web publishers and AI crawlers, and we all stand to lose.

We often take the internet for granted. It’s an ocean of information at our fingertips—and it simply works. But this system relies on swarms of “crawlers”—bots that roam the web, visit millions of websites every day, and report what they see. This is how Google powers its search engines, how Amazon sets competitive prices, and how Kayak aggregates travel listings. Beyond the world of commerce, crawlers are essential for monitoring web security, enabling accessibility tools, and preserving historical archives. Academics, journalists, and civil societies also rely on them to conduct crucial investigative research.

Crawlers are endemic. Now representing half of all internet traffic, they will soon outpace human traffic. This unseen subway of the web ferries information from site to site, day and night. And as of late, they serve one more purpose: Companies such as OpenAI use web-crawled data to train their artificial intelligence systems, like ChatGPT.

» MIT Technology Review →

Written by: thechelsuk
Published: 8 April 2025, with 191 words, 0 replies.