OpenAI aims to silence concerns it is falling behind in the AI race with release of new model GPT-5.2

OpenAI announced GPT-5.2, a new model it says materially outperforms prior versions and rivals from Anthropic and Google on professional and coding tasks — claiming 70.9% on its GDPval benchmark (versus 38.8% for GPT-5, 59.6% for Claude Opus 4.5 and 53.3% for Gemini 3 Pro) and 55.6% on SWE‑Bench Pro (roughly 5 percentage points above GPT‑5.1 and >12 points above Gemini 3 Pro). The company, which says the model had been in development for months and tested by alpha customers including Harvey, Notion, Box, Shopify and Zoom, highlights improvements in tool use, coding, mathematical reasoning and safety measures, while declining to disclose detailed training methods beyond broad pretraining enhancements; executives also said a recent internal “Code Red” refocused resources on ChatGPT. The rollout arrives amid intensifying competition from Google and Anthropic and coincides with a new lawsuit alleging ChatGPT’s role in a Connecticut murder‑suicide, underscoring simultaneous commercial upside and legal/reputational risk as OpenAI seeks to reclaim enterprise share in coding and knowledge‑work use cases.

Analysis

OpenAI released GPT-5.2 less than a month after GPT-5.1, asserting material performance gains across professional and coding tasks. On its GDPval benchmark GPT-5.2 met or exceeded human expert performance 70.9% of the time versus 38.8% for GPT-5, 59.6% for Anthropic's Claude Opus 4.5 and 53.3% for Google's Gemini 3 Pro; on SWE‑Bench Pro GPT‑5.2 scored 55.6%, roughly five percentage points above GPT‑5.1 and more than 12 points above Gemini 3 Pro. OpenAI reports the model was in the hands of alpha customers including Harvey, Notion, Box, Shopify and Zoom and highlights stronger tool use, coding and mathematical reasoning—capabilities tied to enterprise deployments and monetization. Executives framed the launch amid an internal "Code Red" prioritization of ChatGPT and acknowledged broad improvements in pretraining while declining to disclose specific training techniques. The commercial case is juxtaposed with legal and reputational risk: the rollout coincides with a Connecticut murder‑suicide lawsuit and other suits alleging ChatGPT contributed to suicides even as OpenAI emphasizes improved safety completions. Sentiment signals are mildly positive but cautious; investors should prioritize observable adoption metrics, third‑party validation and litigation developments before re‑rating AI‑exposed equities.

AllMind

AllMind

OpenAI aims to silence concerns it is falling behind in the AI race with release of new model GPT-5.2

Analysis

AllMind AI Terminal

Market Sentiment

Key Decisions for Investors