AnthropicNovember 24, 2025· Multimodal

Claude Opus 4.5

Name: Claude Opus 4.5
Price: 5 USD
Author: Anthropic

First model to break 80% on SWE-bench Verified

Singularity Index

77.7

RANK 14 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
HumanEval Coding ability - generating correct Python functions	96.4%	#3 / 50
MATH Competition-level mathematics problems	100%	#3 / 50
ARC-C Grade-school science questions requiring reasoning	98.6%	#3 / 40
SWE-bench Real-world GitHub issue resolution	80.9%	#3 / 40
MMLU Tests knowledge across 57 subjects from STEM to humanities	92.8%	#5 / 54
HellaSwag Common sense reasoning about everyday situations	96.8%	#7 / 36
Terminal Agentic terminal coding tasks requiring multi-step execution	59.8%	#9 / 48
ARC-AGI Novel reasoning tasks requiring fluid intelligence	37.6%	#12 / 23
MMMUvals.ai College-level multimodal reasoning across 30+ disciplines	82.9%	#13 / 39
MMLU-Provals.ai Harder 10-option successor to MMLU; more reasoning-focused	87.3%	#14 / 38
LiveCodeBenchvals.ai Contamination-free competitive programming (filtered by cutoff date)	83.7%	#20 / 40
Arena Elo Human preference ranking via blind comparisons	1445	#24 / 52
GPQA PhD-level science questions even experts struggle with	87%	#27 / 64

Input

per 1M tokens

Output

$25

per 1M tokens

Context

200K

tokens

Speed

77.7

tokens / sec