Company Overview

OpenAI

Founded 2015HQ San Francisco, USA

OpenAI develops frontier multimodal models with a focus on scalable oversight and policy alignment.

Safety Philosophy

Layered defense combining pre-training filters, post-training alignment, and real-time monitoring.

Capability Focus

High reasoning performance with integrated tool use and long-context planning.

Portfolio Metrics Over Time

Aggregate Inspect results

Benchmark Performance

Safety vs Capability

Model roster

  • OpenAI GPT-5 (high)Released 2025-08-12
    Safety 72%Capability 88%Composite 92%

    Current leader on FrontierMath and SWE-bench Verified.

  • OpenAI GPT-4oReleased 2024-05-13
    Safety 75%Capability 68%Composite 84%

    Legacy workhorse with strong refusal behaviour.

Alignment approach

OpenAI has leaned into Inspect evals to stress-test honesty under adversarial prompting, iterating on policy pack enforcement for enterprise deployments.

Roadmap

Upcoming policy monitors include scenario-specific gating for dangerous autonomy and cross-domain exploit detection.