AIOps is so powerful, vendors are building tools to clean up after agents break your infrastructure
Cohesity, ServiceNow and Datadog team on recoverability suite Three more vendors have decided that the world needs tools to roll back mistakes made by AI, after Cohesity teamed with ServiceNow and Datadog on a recoverability service that will hunt down all the files and data corrupted by bad AI actors and restore systems to a “trusted state.”…

In the rapidly evolving world of artificial intelligence-driven operations (AIOps), the potential for automation and efficiency is immense. However, as AI systems become more integral to infrastructure management, the risks of unintended consequences also grow. In response to these challenges, several technology vendors are developing tools designed to mitigate the damage caused by AI-driven agents.
Cohesity, ServiceNow, and Datadog have recently announced a collaborative effort to create a recoverability suite, which aims to identify and rectify the impact of AI-induced errors. This initiative comes as a direct response to the increasing reliance on AI for managing complex systems, where a single misstep can lead to significant disruptions.
The recoverability service developed by these vendors operates by scanning the entire infrastructure to locate files and data that have been corrupted or altered by AI agents. Once these problematic elements are identified, the system initiates a process to restore the affected components to a "trusted state," ensuring that the infrastructure functions as intended.
This development is not an isolated effort. Three additional vendors have independently decided to build tools that address the same concerns, underscoring the growing recognition of the need for robust recovery mechanisms in AIOps environments. As organizations continue to adopt AI-driven solutions, the potential for unintended consequences becomes a critical concern, prompting vendors to prioritize the development of recovery tools.
The collaborative effort between Cohesity, ServiceNow, and Datadog highlights the importance of proactive measures to safeguard against AI-induced disruptions. By identifying and rectifying errors in real-time, these tools help maintain the integrity of critical systems, ensuring that the benefits of AIOps are realized without compromising the stability of the infrastructure.
The recoverability suite is designed to work seamlessly within existing AIOps environments, minimizing the need for manual intervention. This automation not only enhances the efficiency of the recovery process but also reduces the risk of human error, which can further exacerbate the situation.
The decision of multiple vendors to invest in such tools signifies a shift in the industry's approach to AI-driven operations. While the potential of AIOps remains undeniable, the focus is now on ensuring that the systems are resilient and capable of recovering from errors. This, in turn, fosters greater confidence among organizations considering the adoption of AI-driven solutions.
In conclusion, the emergence of recoverability tools from multiple vendors reflects a growing awareness of the risks associated with AI-driven operations. By enabling organizations to identify and rectify AI-induced errors, these tools help maintain the stability and reliability of critical infrastructure. As AIOps continues to evolve, the ability to recover from mistakes will become an essential component of effective AI management strategies.










