Home TechnologyAIOps is so powerful, vendors are building tools t...
Technology⭐ Featured

AIOps is so powerful, vendors are building tools to clean up after agents break your infrastructure

Cohesity, ServiceNow and Datadog team on recoverability suite Three more vendors have decided that the world needs tools to roll back mistakes made by AI, after Cohesity teamed with ServiceNow and Datadog on a recoverability service that will hunt down all the files and data corrupted by bad AI actors and restore systems to a ā€œtrusted state.ā€ā€¦

7 April 2026 at 04:46 am
1 views
AIOps is so powerful, vendors are building tools to clean up after agents break your infrastructure

In the rapidly evolving world of artificial intelligence-driven operations (AIOps), companies are increasingly relying on AI to manage and optimize their infrastructure. However, as the adoption of AI grows, so does the risk of unintended consequences. To address this, several vendors are now developing tools designed to mitigate the damage caused by AI-driven agents that may inadvertently corrupt data or disrupt systems.

Cohesity, ServiceNow, and Datadog have recently announced a collaborative effort to create a recoverability suite, which aims to identify and restore files and data that have been compromised by AI-related mistakes. This initiative comes as a response to the growing concern among organizations about the potential risks associated with AI-driven operations.

The recoverability suite works by continuously monitoring the infrastructure for any anomalies or corruption caused by AI agents. Once detected, the tool initiates a process to isolate and restore the affected components to a "trusted state," ensuring minimal downtime and data loss. This capability is particularly important in environments where AI is heavily integrated into operational workflows, as it can prevent costly disruptions and maintain the reliability of critical systems.

In addition to Cohesity, ServiceNow, and Datadog, three other vendors have also recognized the need for such tools. These companies are developing their own solutions to address the challenges posed by AI-driven operations. The emergence of these products highlights the growing concern around the potential risks of AI in infrastructure management and the need for robust recovery mechanisms.

The development of these tools is a direct result of real-world incidents where AI-driven agents have caused unintended harm. For example, an AI-powered automation tool might inadvertently modify sensitive data or misconfigure a system, leading to performance degradation or data loss. In such cases, the ability to quickly identify and recover from these issues is crucial for maintaining operational continuity.

The recoverability suite is not just a reactive solution; it also plays a proactive role in ensuring the integrity of the infrastructure. By continuously monitoring and analyzing system behavior, the tool can detect potential issues before they escalate, allowing administrators to take preventive measures. This proactive approach helps organizations avoid costly downtimes and ensures that their AI-driven operations remain reliable and secure.

The collaboration between Cohesity, ServiceNow, and Datadog is a significant step towards addressing the challenges posed by AIOps. As more organizations adopt AI-driven solutions, the need for robust recovery mechanisms will only grow. The development of tools like the recoverability suite is essential for mitigating the risks associated with AI in infrastructure management and ensuring the smooth operation of complex systems.

In conclusion, the increasing adoption of AI-driven operations has led to a heightened awareness of the potential risks and the need for effective recovery solutions. The collaborative efforts of vendors like Cohesity, ServiceNow, Datadog, and others are addressing these concerns by developing tools that can identify and mitigate the damage caused by AI-related mistakes. As organizations continue to integrate AI into their operations, the ability to quickly recover from unintended consequences will be crucial for maintaining the reliability and security of their infrastructure.

šŸ“° Related News
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras founder Palak Shah recently opened up about one of the most expensive mistakes she made while building her luxury textile brand. During the early years of the company, Shah rented a premium billboard near Delhi’s DLF Emporio to increase brand visibility. However, after forgetting to cancel the campaign, the hoarding reportedly continued running for months — resulting in losses of nearly ₹40 lakh. The incident has now become a viral example of how small operational oversights can turn into costly business lessons for startups and entrepreneurs.
28 May
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Before AI was inevitable, it was a gamble—and Jensen Huang went all in.
14 Apr
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat is excited to announce the release of Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1, marking a major leap forward in our confidential computing journey. These releases graduate confidential containers on bare metal from …
14 Apr
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
YC Startup School: India’s talent pool across colleges and universities are key for building next-gen startups, which is what YC is looking to tap into. It wants to target entrepreneurs building for global markets, focussed on fintech, consumer, B2B, and ecom…
14 Apr
TSMC likely to book fourth straight quarter of record profit onĀ insatiable AI demand
TSMC likely to book fourth straight quarter of record profit onĀ insatiable AI demand
TSMC-RESULTS/ (PREVIEW, PIX):PREVIEW-TSMC likely to book fourth straight quarter of record profit onĀ insatiable AI demand
14 Apr
TSMC likely to book fourth straight quarter of record profit onĀ insatiable AI demand
TSMC likely to book fourth straight quarter of record profit onĀ insatiable AI demand
Any profit result ā€Œabove T$505.7 billion would mark the company's highest-ever quarterly net income ​and its ninth consecutive quarter of profit growth
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
On Thursday, ​TSMC is expected to report a net profit of $17.1 billion for the quarter, according to an LSEG SmartEstimate compiled from 19 analysts. The war in the Middle East threatens to disrupt the supply of production materials for semiconductors such as…
14 Apr
If we can’t kick the habit, how do we manage AI’s energy needs?
If we can’t kick the habit, how do we manage AI’s energy needs?
One can only hope that OpenAI’s Sam Altman was joking when he sought to justify the immense energy consumption of artificial intelligence
14 Apr
What caused Nvidia Blackwell GPU prices to spike? #tech
What caused Nvidia Blackwell GPU prices to spike? #tech
Blackwell GPU hourly ā€œrentā€ surges on agentic AI demand A compute pricing index tracking hourly costs for Nvidia Blackwell GPUs shows a sharp climb: hourly rental hit $4.08 , up 48% from $2.75 just two months earlier. The reported driver is rising demand tied…
14 Apr
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic has introduced Claude Mythos Preview, its most advanced AI model, improving significantly in reasoning, coding, and cybersecurity. Unlike previous releases, it will not be publicly available. Access is limited to a consortium of tech companies throu…
14 Apr