Why we're rethinking cache for the AI era
The explosion of AI-bot traffic, representing over 10 billion requests per week, has opened up new challenges and opportunities for cache design. We look at some of the ways AI bot traffic differs from humans, how this impacts CDN cache, and some early ideas for how Cloudflare is designing systems to improve the AI and human experience.

The rapid growth of AI-bot traffic, accounting for over 10 billion requests per week, has brought new challenges and opportunities to the world of cache design. As AI systems become increasingly reliant on web data to enhance their knowledge and capabilities, the way content delivery networks (CDNs) handle this traffic is being reevaluated. In this article, we explore the differences between AI bot traffic and human behavior, the impact on CDN cache, and early ideas for improving both AI and human experiences through innovative cache designs.
Cloudflare's data reveals that 32% of network traffic originates from automated sources, including search engine crawlers, uptime checkers, ad networks, and more recently, AI assistants. These AI bots access the web to gather relevant data for their knowledge bases, using retrieval-augmented generation (RAG) to generate responses. Unlike typical human behavior, AI agents, crawlers, and scrapers exhibit distinct patterns. For instance, they often issue high-volume requests in parallel, access rarely visited or loosely related content across a site, and perform sequential, complete scans of websites. An AI assistant generating a response might fetch images, documentation, and knowledge articles from dozens of unrelated sources.
While Cloudflare provides tools to control and limit automated access, many websites may want to serve AI traffic. For example, developers might ensure their documentation is up-to-date in foundational AI models, e-commerce sites may want product descriptions to appear in LLM search results, and publishers may seek payment for their content through mechanisms like pay-per-crawl. However, website operators face a dilemma: optimize for AI crawlers or human traffic. Current cache architectures force a choice between these two, as both exhibit widely different traffic patterns.
AI traffic poses unique challenges for cache design. Traditional caching strategies, optimized for human browsing patterns, may not efficiently handle the high-volume, parallel requests and diverse content access patterns of AI bots. This can lead to increased server load, reduced performance, and higher costs for both the content provider and the CDN. Additionally, the need to serve AI traffic while maintaining human-centric experiences complicates cache management.
To address these challenges, researchers at ETH Zurich and Cloudflare have begun exploring new cache designs tailored to the AI era. One approach is to develop adaptive caching systems that dynamically adjust to the traffic patterns of different users or bots. For instance, a system might prioritize caching frequently accessed content for humans while allowing AI bots to bypass the cache for less popular or niche content.
Another idea is to implement a tiered cache architecture, with separate layers optimized for different types of traffic. A high-performance layer could handle human requests, while a more flexible layer accommodates AI bot needs. This approach would allow operators to balance resource allocation and performance across both user groups.
Furthermore, researchers are investigating the potential for AI-driven cache optimization. Machine learning models could analyze traffic patterns and predict future access, enabling more efficient cache management. For example, an AI model might identify that a particular AI bot is likely to access a set of content frequently and pre-cache that data to reduce latency and improve performance.
Collaboration between CDNs, content providers, and AI developers is crucial for addressing these challenges. As AI traffic continues to grow, the need for innovative cache designs that support both human and AI users becomes increasingly important. By rethinking cache architectures and adapting to the unique demands of AI bots, the web can better serve the evolving needs of both users and AI systems.
In conclusion, the explosion of AI-bot traffic has forced a reevaluation of cache design to accommodate the distinct patterns of these systems. While traditional caching strategies may struggle to meet the demands of AI bots, new approaches such as adaptive caching, tiered architectures, and AI-driven optimization offer promising solutions. As the AI era progresses, continued collaboration between stakeholders will be essential to ensure that the web remains efficient, accessible, and beneficial for both humans and AI-driven applications.










