Home TechnologyKernelEvolve: How Meta’s Ranking Engineer Agent Op...
Technology⭐ Featured

KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure

This is the second post in the Ranking Engineer Agent blog series exploring the autonomous AI capabilities accelerating Meta’s Ads Ranking innovation. The previous post introduced Ranking Engineer Agent’s ML exploration capability, which autonomously designs, executes, and analyzes ranking model experiments. This post covers how to optimize the low-level infrastructure that makes those models run [...] Read More... The post KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure appeared first on Engineering at Meta .

6 April 2026 at 06:45 pm
1 views
KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure

Meta's Ranking Engineer Agent is pushing the boundaries of AI innovation, and a key component of this effort is the optimization of its underlying infrastructure. In this article, we delve into KernelEvolve, an agentic kernel authoring system that enables the efficient execution of AI models at scale.

Meta operates a vast fleet of heterogeneous hardware, including NVIDIA GPUs, AMD GPUs, Meta's custom MTIA silicon chips, and CPUs. Each of these hardware types requires specialized software to translate high-level model operations into efficient, chip-specific instructions known as optimized kernels. Traditionally, the process of authoring and optimizing kernels has been a time-consuming and labor-intensive task, requiring human experts to hand-tune kernels for each new chip generation and ML model architecture.

However, with the increasing number of models and the diversity of hardware types and generations, this manual approach has become unsustainable. To address this challenge, Meta developed KernelEvolve, an autonomous agent that optimizes performance for AI models. KernelEvolve significantly accelerates the development process by compressing weeks of expert engineering time—including profiling, optimizing, and cross-hardware debugging—into just hours of automated search and evaluation. This automation not only saves time but also frees up human engineers to focus on other critical tasks.

Moreover, KernelEvolve delivers substantial performance improvements. For instance, it achieved over 60% inference throughput improvement for the Andromeda Ads model on NVIDIA GPUs and over 25% training throughput improvement for an ads model on Meta's custom MTIA chips. These enhancements are crucial for maintaining the efficiency and scalability of Meta's AI infrastructure, which powers a wide range of services and applications.

Beyond the specific use case of Meta's Ranking Engineer Agent, KernelEvolve is a general-purpose solution applicable to a variety of AI models and hardware configurations. By automating the kernel optimization process, KernelEvolve ensures that the full potential of diverse hardware is harnessed, enabling faster development cycles and better performance across the entire AI ecosystem.

In conclusion, KernelEvolve represents a significant advancement in Meta's ongoing quest to optimize its AI infrastructure. By leveraging autonomous agents like KernelEvolve, Meta is able to efficiently scale its AI capabilities, ensuring that its models run at peak performance on a diverse range of hardware platforms. This innovation not only benefits Meta's Ads Ranking innovation but also sets a precedent for the broader AI community, demonstrating the potential of automated, agentic systems to drive performance and efficiency in complex, large-scale AI environments.

📰 Related News
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras founder Palak Shah recently opened up about one of the most expensive mistakes she made while building her luxury textile brand. During the early years of the company, Shah rented a premium billboard near Delhi’s DLF Emporio to increase brand visibility. However, after forgetting to cancel the campaign, the hoarding reportedly continued running for months — resulting in losses of nearly ₹40 lakh. The incident has now become a viral example of how small operational oversights can turn into costly business lessons for startups and entrepreneurs.
28 May
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Before AI was inevitable, it was a gamble—and Jensen Huang went all in.
14 Apr
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat is excited to announce the release of Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1, marking a major leap forward in our confidential computing journey. These releases graduate confidential containers on bare metal from …
14 Apr
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
YC Startup School: India’s talent pool across colleges and universities are key for building next-gen startups, which is what YC is looking to tap into. It wants to target entrepreneurs building for global markets, focussed on fintech, consumer, B2B, and ecom…
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC-RESULTS/ (PREVIEW, PIX):PREVIEW-TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
Any profit result ‌above T$505.7 billion would mark the company's highest-ever quarterly net income ​and its ninth consecutive quarter of profit growth
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
On Thursday, ​TSMC is expected to report a net profit of $17.1 billion for the quarter, according to an LSEG SmartEstimate compiled from 19 analysts. The war in the Middle East threatens to disrupt the supply of production materials for semiconductors such as…
14 Apr
If we can’t kick the habit, how do we manage AI’s energy needs?
If we can’t kick the habit, how do we manage AI’s energy needs?
One can only hope that OpenAI’s Sam Altman was joking when he sought to justify the immense energy consumption of artificial intelligence
14 Apr
What caused Nvidia Blackwell GPU prices to spike? #tech
What caused Nvidia Blackwell GPU prices to spike? #tech
Blackwell GPU hourly “rent” surges on agentic AI demand A compute pricing index tracking hourly costs for Nvidia Blackwell GPUs shows a sharp climb: hourly rental hit $4.08 , up 48% from $2.75 just two months earlier. The reported driver is rising demand tied…
14 Apr
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic has introduced Claude Mythos Preview, its most advanced AI model, improving significantly in reasoning, coding, and cybersecurity. Unlike previous releases, it will not be publicly available. Access is limited to a consortium of tech companies throu…
14 Apr