xAINovember 17, 2025· Multimodal

Grok 4.1

Name: Grok 4.1
Price: 3 USD
Author: xAI

#1 on Arena thinking mode, 1M context, strong agentic coding

Singularity Index

69.9

RANK 34 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
MMLU Tests knowledge across 57 subjects from STEM to humanities	93.1%	#3 / 54
ARC-C Grade-school science questions requiring reasoning	98.5%	#4 / 40
HumanEval Coding ability - generating correct Python functions	96.1%	#5 / 50
MATH Competition-level mathematics problems	98.4%	#5 / 50
HellaSwag Common sense reasoning about everyday situations	97.2%	#5 / 36
Arena Elo Human preference ranking via blind comparisons	1483	#14 / 52
SWE-bench Real-world GitHub issue resolution	74.6%	#18 / 40
GPQA PhD-level science questions even experts struggle with	88%	#23 / 64
LiveCodeBenchvals.ai Contamination-free competitive programming (filtered by cutoff date)	80.6%	#24 / 40
MMLU-Provals.ai Harder 10-option successor to MMLU; more reasoning-focused	84.2%	#25 / 38
MMMUvals.ai College-level multimodal reasoning across 30+ disciplines	72.7%	#32 / 39
TerminalArtificial Analysis Agentic terminal coding tasks requiring multi-step execution	24.2%	#38 / 48

Input

per 1M tokens

Output

$15

per 1M tokens

Context

tokens