---
type: "Evidence Item"
title: "Continuously hardening ChatGPT Atlas against prompt injection"
description: "OpenAI is strengthening ChatGPT Atlas against prompt injection attacks using automated red teaming trained with reinforcement learning. This proactive discover-and-patch loop."
resource: "https://openai.com/index/hardening-atlas-against-prompt-injection"
tags: ["appendix-iii", "vendor", "openai"]
timestamp: "2025-12-22"
category: "vendor"
publisher: "OpenAI"
cope_score: 74
confidence: 0.9
---

# Continuously hardening ChatGPT Atlas against prompt injection

# Claim

OpenAI is strengthening ChatGPT Atlas against prompt injection attacks using automated red teaming trained with reinforcement learning. This proactive discover-and-patch loop helps identify novel exploits early and harden the browser agent’s defenses as AI becomes more agentic.

# Relevance

Appendix III, section two: vendor threshold and platform capability evidence

# Oracle Verdict

This is a lower-to-mid strength vendor signal for the capability register. It does not prove displacement on its own, but it records another platform step that can later show up as workflow automation, procurement change, or organisational dependency.

# Metadata

* Publisher: OpenAI
* Category: vendor
* Sector: Cybersecurity
* Capability: Cyber defence and misuse monitoring
* Cope score: 74
* Confidence: 0.9

# Related Concepts

* [Live evidence index](index.md)
* [Thesis](../thesis.md)

# Citations

[1] [Continuously hardening ChatGPT Atlas against prompt injection](https://openai.com/index/hardening-atlas-against-prompt-injection)
