---
type: "Evidence Item"
title: "Advancing science and math with GPT-5.2"
description: "GPT-5.2 is OpenAI’s strongest model yet for math and science, setting new state-of-the-art results on benchmarks like GPQA Diamond and FrontierMath. This post shows how those."
resource: "https://openai.com/index/gpt-5-2-for-science-and-math"
tags: ["appendix-iii", "benchmark", "openai"]
timestamp: "2025-12-11"
category: "benchmark"
publisher: "OpenAI"
cope_score: 88
confidence: 0.9
---

# Advancing science and math with GPT-5.2

# Claim

GPT-5.2 is OpenAI’s strongest model yet for math and science, setting new state-of-the-art results on benchmarks like GPQA Diamond and FrontierMath. This post shows how those gains translate into real research progress, including solving an open theoretical problem and generating reliable mathematical proofs.

# Relevance

Appendix III, section one: model and benchmark capability evidence

# Oracle Verdict

OpenAI is describing a frontier or production capability that pushes directly on the thesis. The important signal is not the marketing language; it is the widening set of tasks now being routed through model-driven execution rather than ordinary software or headcount.

# Metadata

* Publisher: OpenAI
* Category: benchmark
* Sector: Scientific research
* Capability: Frontier model release and benchmark movement
* Cope score: 88
* Confidence: 0.9

# Related Concepts

* [Live evidence index](index.md)
* [Thesis](../thesis.md)

# Citations

[1] [Advancing science and math with GPT-5.2](https://openai.com/index/gpt-5-2-for-science-and-math)
