Home InternationalUpgrading the Moderation API with our new multimod...
International⭐ Featured

Upgrading the Moderation API with our new multimodal moderation model

We’re introducing a new model built on GPT-4o that is more accurate at detecting harmful text and images, enabling developers to build more robust moderation systems.

6 April 2026 at 12:01 pm
1 views
Upgrading the Moderation API with our new multimodal moderation model

In recent years, the challenge of moderating online content has grown increasingly complex as platforms face an influx of user-generated material. To address this, we are excited to announce the launch of our new multimodal moderation model, built on the cutting-edge GPT-4o architecture. This innovative development promises to significantly enhance the accuracy of detecting harmful text and images, empowering developers to create more robust moderation systems.

The new model leverages the advanced capabilities of GPT-4o, which is designed to process and understand both textual and visual data. This multimodal approach allows the system to analyze content from multiple perspectives, improving its ability to identify and flag inappropriate or harmful content. By integrating this model into existing platforms, developers can ensure that their moderation systems are better equipped to handle the diverse and evolving nature of online content.

One of the key advantages of the new model is its improved accuracy in detecting harmful text. Previous moderation systems often struggled with identifying nuanced or context-dependent harmful content, such as sarcasm or subtle threats. The multimodal model's ability to understand context and intent enhances its capacity to recognize and respond to such instances. This not only protects users from exposure to harmful content but also reduces the burden on human moderators, allowing them to focus on more complex cases.

In addition to text, the new model also excels at detecting harmful images. With the rise of visual content on social media and other platforms, the ability to identify and remove inappropriate or harmful images has become crucial. The multimodal approach enables the system to analyze images in conjunction with their surrounding text, providing a more comprehensive understanding of the content's context. This cross-modal analysis helps to prevent misclassification of benign images that might be mistakenly flagged as harmful due to their visual nature alone.

The development of this new moderation model is part of our ongoing commitment to improving the safety and quality of online environments. By providing developers with a more accurate and effective tool, we hope to see a significant reduction in the prevalence of harmful content across various platforms. This, in turn, can foster healthier online communities where users feel safe and respected.

Furthermore, the multimodal model's flexibility allows it to be easily integrated into existing moderation pipelines. Developers can seamlessly incorporate the new system into their applications, leveraging its advanced capabilities without overhauling their current infrastructure. This ease of integration ensures that the benefits of the improved moderation model can be quickly realized across a wide range of platforms and services.

In conclusion, the introduction of our new multimodal moderation model built on GPT-4o represents a significant step forward in the fight against harmful content online. By combining advanced text and image analysis, the system offers developers a powerful tool to build more robust moderation systems. As online communities continue to grow and evolve, this innovation is poised to make a meaningful impact on the safety and quality of digital interactions.

Source: OpenAI News
📰 Related News
Ollama 0.2.6 Released with Native Gemma 4 Support and Enhanced Performance
Ollama 0.2.6 Released with Native Gemma 4 Support and Enhanced Performance
Ollama 0.2.6 is now live, featuring native support for Google's Gemma 4 models and improved local inference performance for Windows, macOS, and Linux.
14 Apr
Weekly news roundup: Shortages spread to MLCCs; SK Hynix reportedly in talks with Microsoft and Google
Weekly news roundup: Shortages spread to MLCCs; SK Hynix reportedly in talks with Microsoft and Google
Below are the most-read DIGITIMES Asia stories from the week of April 6-April 13, 2026:
14 Apr
cutile-stencil 0.2.0
cutile-stencil 0.2.0
An xDSL-based stencil compiler that generates optimized GPU kernels via NVIDIA cuTile
14 Apr
merlin-llm added to PyPI
merlin-llm added to PyPI
Merlin — a fast local LLM for agentic coding on Apple Silicon
14 Apr
Fluent Cut - Craft and compose videos programmatically in PHP with an elegant fluent API
Fluent Cut - Craft and compose videos programmatically in PHP with an elegant fluent API
Craft and compose videos programmatically in PHP with an elegant fluent API - b7s/fluentcut
14 Apr
Crypto Investor at Center of Trump Corruption Allegations Now Sees Himself as ‘Victim’
Crypto Investor at Center of Trump Corruption Allegations Now Sees Himself as ‘Victim’
Justin Sun has accused Trump-affiliated World Liberty Financial of misconduct and a general lack of transparency.
14 Apr
nvidia-nat-weave 1.7.0a20260413
nvidia-nat-weave 1.7.0a20260413
Subpackage for Weave integration in NeMo Agent Toolkit
14 Apr
nvidia-nat-s3 1.7.0a20260413
nvidia-nat-s3 1.7.0a20260413
Subpackage for S3-compatible integration in NeMo Agent Toolkit
14 Apr
Social Security Trust Fund to Run Dry in 2032: Just 6 Years From Now
Social Security Trust Fund to Run Dry in 2032: Just 6 Years From Now
Six years. That is how much time separates retirees from a Social Security system that, by its own projections, runs out of money. If you are 56 years old...
14 Apr
cane-gpu-perf added to PyPI
cane-gpu-perf added to PyPI
GPU inference benchmarking with opinionated diagnostics
13 Apr