Home TechnologyPowering the agents: Workers AI now runs large mod...
Technology⭐ Featured

Powering the agents: Workers AI now runs large models, starting with Kimi K2.5

Kimi K2.5 is now on Workers AI, helping you power agents entirely on Cloudflare’s Developer Platform. Learn how we optimized our inference stack and reduced inference costs for internal agent use cases.

6 April 2026 at 07:26 pm
1 views
Powering the agents: Workers AI now runs large models, starting with Kimi K2.5

Cloudflare has taken a significant step forward in its mission to make its Developer Platform the best environment for building and deploying AI agents. By launching Workers AI, the company is now offering frontier-scale open-source models directly on its AI inference platform, starting with Moonshot AI's Kimi K2.5. This move enables developers to run the entire agent lifecycle on a unified platform, optimizing both performance and cost.

For years, Cloudflare has been building the foundational primitives necessary for creating robust agents. These include Durable Objects for state persistence, Workflows for long-running tasks, and Dynamic Workers or Sandbox containers for secure execution. The Agents SDK, designed to simplify agent development, sits atop these primitives. However, these tools only provided the execution environment; the AI model powering the agent remained a separate consideration.

Workers AI now addresses this gap by running large models that are capable of powering agents with high reasoning capabilities and large context windows. The company's focus on optimizing the inference stack has resulted in significant cost reductions for internal agent use cases. This efficiency allows developers to leverage powerful AI models without compromising on performance or budget.

Cloudflare's journey with Kimi K2.5 began as an experiment, but it quickly evolved into a critical component for internal development tools. Engineers at Cloudflare have adopted Kimi as a daily driver for agentic coding tasks within the OpenCode environment. Additionally, the model has been integrated into the company's automated code review pipeline, with a public demonstration available through the Bonk agent on Cloudflare's GitHub repositories.

In production, Kimi K2.5 has proven to be a fast and efficient alternative to larger proprietary models, without sacrificing quality. By bringing this frontier-scale model directly into the Cloudflare Developer Platform, the company is making it possible to run the entire agent lifecycle on a single, unified platform. This integration streamlines development, deployment, and maintenance, allowing developers to focus on building intelligent agents without worrying about the underlying infrastructure.

The launch of Workers AI marks a significant milestone for Cloudflare's vision of creating a comprehensive ecosystem for AI agents. By offering frontier-scale models on its AI inference platform, the company is positioning itself as a leader in the field, providing developers with the tools and infrastructure they need to build and deploy advanced AI applications seamlessly.

In conclusion, Cloudflare's integration of Kimi K2.5 into Workers AI represents a major leap forward in its mission to make its Developer Platform the go-to environment for building and deploying AI agents. With a focus on optimizing performance and reducing costs, the company is enabling developers to leverage powerful AI models without compromising on efficiency. As Cloudflare continues to expand its offerings, it is poised to become a central hub for AI development, offering a unified platform that simplifies the entire agent lifecycle.

📰 Related News
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras founder Palak Shah recently opened up about one of the most expensive mistakes she made while building her luxury textile brand. During the early years of the company, Shah rented a premium billboard near Delhi’s DLF Emporio to increase brand visibility. However, after forgetting to cancel the campaign, the hoarding reportedly continued running for months — resulting in losses of nearly ₹40 lakh. The incident has now become a viral example of how small operational oversights can turn into costly business lessons for startups and entrepreneurs.
28 May
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Before AI was inevitable, it was a gamble—and Jensen Huang went all in.
14 Apr
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat is excited to announce the release of Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1, marking a major leap forward in our confidential computing journey. These releases graduate confidential containers on bare metal from …
14 Apr
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
YC Startup School: India’s talent pool across colleges and universities are key for building next-gen startups, which is what YC is looking to tap into. It wants to target entrepreneurs building for global markets, focussed on fintech, consumer, B2B, and ecom…
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC-RESULTS/ (PREVIEW, PIX):PREVIEW-TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
Any profit result ‌above T$505.7 billion would mark the company's highest-ever quarterly net income ​and its ninth consecutive quarter of profit growth
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
On Thursday, ​TSMC is expected to report a net profit of $17.1 billion for the quarter, according to an LSEG SmartEstimate compiled from 19 analysts. The war in the Middle East threatens to disrupt the supply of production materials for semiconductors such as…
14 Apr
If we can’t kick the habit, how do we manage AI’s energy needs?
If we can’t kick the habit, how do we manage AI’s energy needs?
One can only hope that OpenAI’s Sam Altman was joking when he sought to justify the immense energy consumption of artificial intelligence
14 Apr
What caused Nvidia Blackwell GPU prices to spike? #tech
What caused Nvidia Blackwell GPU prices to spike? #tech
Blackwell GPU hourly “rent” surges on agentic AI demand A compute pricing index tracking hourly costs for Nvidia Blackwell GPUs shows a sharp climb: hourly rental hit $4.08 , up 48% from $2.75 just two months earlier. The reported driver is rising demand tied…
14 Apr
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic has introduced Claude Mythos Preview, its most advanced AI model, improving significantly in reasoning, coding, and cybersecurity. Unlike previous releases, it will not be publicly available. Access is limited to a consortium of tech companies throu…
14 Apr