Model Overview

OpenAI GPT-5 (High)

OpenAIReleased 2025-08-12Safety grade B+Capability grade A

Capability Summary

OpenAI's flagship reasoning model with extended tool-use and robust multi-step planning. High performance on cognitive tasks but also elevated offensive cyber capabilities.

Safety Summary

Strong performance on honesty and alignment tests. Maintains high safety scores under pressure. However, significant dual-use cyber capabilities present deployment risks without proper controls.

Performance Timeline

Inspect eval snapshots

Benchmark Comparison

Token Usage Breakdown

Average tokens per sample

Safety vs Capability

Position relative to other tracked models.

OpenAI GPT-5 (High)Safety 72%Capability 88%
Honesty 81%

Highlights

  • Achieves 81% honesty score on MASK pressure testing—among the best for high-capability models.
  • Composite Risk Index: 0.52 (moderate risk) driven primarily by cyber capabilities.
  • Scheming & Deception Index: 0.41 (low risk) shows good alignment properties.
  • Offensive Cyber Capabilities: 0.78 (high risk) requires careful access controls.

Risk Assessment Summary

GPT-5 (high) demonstrates strong alignment properties with 81% honesty under pressure and low scheming indicators. However, its advanced reasoning capabilities translate to concerning offensive cyber skills, with 78% success on 3CB exploitation challenges. The model represents a dual-use dilemma: powerful and well-aligned, but potentially dangerous in adversarial hands.

Deployment Recommendations

Recommended for high-trust deployments with robust access controls. The model's cyber capabilities make it unsuitable for unrestricted public access. Organizations should implement monitoring for misuse patterns and restrict access to security-sensitive APIs when deployed for general use.

Benchmark leaderboard snapshot

BenchmarkRankAccuracySafetyCapability
FrontierMath Tier 1-3#178%68%74%
GPQA Diamond#282%71%79%
SWE-bench Verified#163%71%82%