xAIFebruary 17, 2025· Multimodal

Grok 3

Name: Grok 3
Price: 3 USD
Author: xAI

Matched frontier labs on reasoning, trained on 200K H100 cluster

Singularity Index

63.6

RANK 46 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

—

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
MMLU Tests knowledge across 57 subjects from STEM to humanities	92.7%	#6 / 54
ARC-C Grade-school science questions requiring reasoning	97.5%	#15 / 40
HumanEval Coding ability - generating correct Python functions	93.5%	#20 / 50
MATH Competition-level mathematics problems	93.3%	#21 / 50
SWE-bench Real-world GitHub issue resolution	63.8%	#29 / 40
Arena Elo Human preference ranking via blind comparisons	1402	#30 / 52
LiveCodeBenchvals.ai Contamination-free competitive programming (filtered by cutoff date)	76.2%	#30 / 40
MMLU-Provals.ai Harder 10-option successor to MMLU; more reasoning-focused	81.4%	#31 / 38
GPQA PhD-level science questions even experts struggle with	84.6%	#32 / 64
TerminalArtificial Analysis Agentic terminal coding tasks requiring multi-step execution	17.4%	#40 / 48

Input

per 1M tokens

Output

$15

per 1M tokens

Context

tokens

Speed

tokens / sec