Devs say AI crawlers dominate traffic, forcing blocks on entire countries

Full Article
Devs say AI crawlers dominate traffic, forcing blocks on entire countries

Software developer Xe Iaso faced severe disruptions due to aggressive AI crawler traffic from Amazon, leading to instability in their Git repository service. Despite implementing standard defenses, such as adjusting robots.txt and blocking known user-agents, the crawlers continued to evade detection. This situation reflects a growing crisis in the open source community, where AI bots are causing significant bandwidth costs and service instability.

The impact of AI crawlers is profound, with some open source projects reporting up to 97% of their traffic from these bots. In response, Iaso developed a proof-of-work challenge system called 'Anubis' to filter out bot traffic, which has shown effectiveness but also introduced delays for legitimate users. The situation has led to broader discussions about the predatory behavior of AI companies and the need for more responsible data collection practices.

• AI crawlers account for up to 97% of traffic in some open source projects.

• Iaso's 'Anubis' system filters out bot traffic but delays legitimate users.

Key AI Terms Mentioned in this Article

AI Crawler

AI crawlers are automated bots that scrape data from websites, often overwhelming server resources.

Proof-of-Work Challenge

A proof-of-work challenge requires users to solve computational puzzles to access content, deterring bot traffic.

Bandwidth Costs

Bandwidth costs refer to the expenses incurred from data transfer, significantly impacted by excessive bot traffic.

Companies Mentioned in this Article

Amazon

Amazon's AI crawlers have been identified as a major source of disruptive traffic to open source projects.

OpenAI

OpenAI's crawlers contribute to significant web traffic, complicating resource management for developers.

Anthropic

Anthropic's AI bots are part of the broader issue of excessive data scraping affecting open source infrastructure.

Alibaba

Crawlers from Alibaba have caused temporary outages in open source project infrastructures, highlighting the problem.

Get Email Alerts for AI News

By creating an email alert, you agree to AIleap's Terms of Service and Privacy Policy. You can pause or unsubscribe from email alerts at any time.

Latest Articles

Alphabet's AI drug discovery platform Isomorphic Labs raises $600M from Thrive
TechCrunch 6month

Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600

AI In Education - Up-level Your Teaching With AI By Cloning Yourself
Forbes 6month

How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.

Trump's Third Term - How AI Can Help To Overthrow The US Government
Forbes 6month

Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.

Sam Altman Says OpenAI Will Release an 'Open Weight' AI Model This Summer
Wired 6month

Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.

Popular Topics