GoogleFebruary 19, 2026· Multimodal

Gemini 3.1 Pro

Name: Gemini 3.1 Pro
Price: 2 USD
Author: Google

Highest GPQA Diamond score ever at 94.3%, doubled ARC-AGI-2 to 77.1%

Singularity Index

86.0

RANK 4 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
GPQA PhD-level science questions even experts struggle with	94.3%	#1 / 64
MMLU-Provals.ai Harder 10-option successor to MMLU; more reasoning-focused	91%	#1 / 38
LiveCodeBenchvals.ai Contamination-free competitive programming (filtered by cutoff date)	88.5%	#1 / 40
ARC-AGI Novel reasoning tasks requiring fluid intelligence	77.1%	#3 / 23
MMMUvals.ai College-level multimodal reasoning across 30+ disciplines	88.2%	#3 / 39
SWE-bench Real-world GitHub issue resolution	80.6%	#5 / 40
Terminal Agentic terminal coding tasks requiring multi-step execution	68.5%	#6 / 48
MMLU Tests knowledge across 57 subjects from STEM to humanities	92.6%	#7 / 54
Arena Elo Human preference ranking via blind comparisons	1520	#7 / 52
HumanEval Coding ability - generating correct Python functions	94.2%	#15 / 50
MATH Competition-level mathematics problems	95.1%	#17 / 50

Input

per 1M tokens

Output

$12

per 1M tokens

Context

tokens

Speed

128.4

tokens / sec