---
type: "Evidence Item"
title: "Introducing Aardvark: OpenAI’s agentic security researcher"
description: "OpenAI introduces Aardvark, an AI-powered security researcher that autonomously finds, validates, and helps fix software vulnerabilities at scale. The system is in private."
resource: "https://openai.com/index/introducing-aardvark"
tags: ["appendix-iii", "benchmark", "openai"]
timestamp: "2025-10-30"
category: "benchmark"
publisher: "OpenAI"
cope_score: 86
confidence: 0.9
---

# Introducing Aardvark: OpenAI’s agentic security researcher

# Claim

OpenAI introduces Aardvark, an AI-powered security researcher that autonomously finds, validates, and helps fix software vulnerabilities at scale. The system is in private beta—sign up to join early testing.

# Relevance

Appendix III, section one: model and benchmark capability evidence

# Oracle Verdict

OpenAI is describing a frontier or production capability that pushes directly on the thesis. The important signal is not the marketing language; it is the widening set of tasks now being routed through model-driven execution rather than ordinary software or headcount.

# Metadata

* Publisher: OpenAI
* Category: benchmark
* Sector: Software engineering
* Capability: Cyber defence and misuse monitoring
* Cope score: 86
* Confidence: 0.9

# Related Concepts

* [Live evidence index](index.md)
* [Thesis](../thesis.md)

# Citations

[1] [Introducing Aardvark: OpenAI’s agentic security researcher](https://openai.com/index/introducing-aardvark)
