AnthropicAugust 5, 2025· Multimodal

Claude Opus 4.1

Name: Claude Opus 4.1
Price: 15 USD
Author: Anthropic

Improved agentic reliability with better tool use and planning

Singularity Index

68.8

RANK 35 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
MMLU-Provals.ai Harder 10-option successor to MMLU; more reasoning-focused	87.9%	#10 / 38
ARC-C Grade-school science questions requiring reasoning	97.6%	#13 / 40
HumanEval Coding ability - generating correct Python functions	94.5%	#14 / 50
HellaSwag Common sense reasoning about everyday situations	92.8%	#19 / 36
SWE-bench Real-world GitHub issue resolution	74.5%	#19 / 40
MMLU Tests knowledge across 57 subjects from STEM to humanities	89.5%	#22 / 54
MMMUvals.ai College-level multimodal reasoning across 30+ disciplines	77.5%	#22 / 39
MATH Competition-level mathematics problems	88.4%	#26 / 50
Arena Elo Human preference ranking via blind comparisons	1372	#32 / 52
TerminalArtificial Analysis Agentic terminal coding tasks requiring multi-step execution	34.3%	#32 / 48
LiveCodeBenchvals.ai Contamination-free competitive programming (filtered by cutoff date)	66.5%	#34 / 40
GPQA PhD-level science questions even experts struggle with	80.9%	#39 / 64

Input

$15

per 1M tokens

Output

$75

per 1M tokens

Context

200K

tokens

Speed

38.2

tokens / sec