News

AI Bots Cause 50% Bandwidth Surge on Wikipedia

AI Bots Cause 50% Bandwidth Surge on Wikipedia

April 04, 2025
AI bots Wikipedia bandwidth surge Wikimedia Foundation data scraping WE5 initiative
AI bots scraping data for model training have led to a 50% surge in Wikipedia's bandwidth since January 2024, straining Wikimedia's servers and increasing technical and financial costs.

AI Bots Cause 50% Bandwidth Surge on Wikipedia

Video: AI is CRASHING Wikipedia! 🤯 50% Bandwidth Spike! - YouTube

AI bots have significantly impacted Wikipedia's bandwidth, leading to a 50% surge since January 2024. This surge is primarily due to automated bots scraping data for AI model training, which has strained Wikimedia's servers and increased bandwidth usage for downloading multimedia content.

Wikimedia Foundation, which hosts Wikipedia and Wikimedia Commons, has reported that these bots are vacuuming up terabytes of data through direct crawling, APIs, and bulk downloads. This non-human traffic has imposed steep technical and financial costs on the foundation, often without proper attribution that supports Wikimedia's volunteer ecosystem.

One notable incident occurred when former US President Jimmy Carter passed away in December 2024. While his Wikipedia page drew millions of views, the simultaneous streaming of a 1.5-hour video from Wikimedia Commons doubled the normal network traffic, temporarily maxing out several Internet connections. This event highlighted the underlying issue of baseline bandwidth being consumed by bots scraping media at scale.

To mitigate these challenges, Wikimedia engineers have been forced to reroute traffic and implement various technical solutions. However, the foundation continues to face difficulties as many AI-focused crawlers ignore robots.txt directives, spoof browser user agents, and rotate through residential IP addresses to evade detection.

Wikimedia is now focusing on systemic approaches under the initiative WE5: Responsible Use of Infrastructure, aiming to establish sustainable boundaries while preserving openness. The foundation emphasizes that while its content is freely licensed, the infrastructure is not, and better coordination between AI developers and resource providers is essential to resolve these issues.

For more detailed information, you can visit the TechTimes article or the Ars Technica article.

Sources

AI bots strain Wikimedia as bandwidth surges 50% - Ars Technica AI bots strain Wikimedia as bandwidth surges 50%. Automated AI bots seeking training data threaten Wikipedia project stability, foundation says.
Wikipedia Sees A 50 Percent Bandwidth Increase Due To AI Bot ... The Wikimedia Foundation says that Wikipedia has experienced a 50% surge in bandwidth, most of which are AI Bot crawlers.
Wikimedia Complains About AI Bots Scraping as It Strains Servers ... Overall, Wikimedia claimed that since January 2024, its bandwidth for downloading content surged by 50%. AI bots that are scraping from their ...