GoogleFebruary 15, 2024· Multimodal

Gemini 1.5 Pro

Name: Gemini 1.5 Pro
Author: Google

First 1M token context window, processed entire codebases in one pass

Singularity Index

40.1

RANK 52 / 56

Reasoning

30%

Coding

25%

Agentic

—

25%

Multimodal

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
HellaSwag Common sense reasoning about everyday situations	92.5%	#20 / 36
ARC-C Grade-school science questions requiring reasoning	94.4%	#31 / 40
HumanEval Coding ability - generating correct Python functions	84.1%	#40 / 50
MATH Competition-level mathematics problems	67.7%	#40 / 50
MMLU Tests knowledge across 57 subjects from STEM to humanities	85.9%	#43 / 54
Arena Elo Human preference ranking via blind comparisons	1260	#44 / 51
MMMUArtificial Analysis College-level multimodal reasoning across 30+ disciplines	55%	#44 / 46
hleArtificial Analysis	4.9%	#54 / 61
GPQA PhD-level science questions even experts struggle with	46.2%	#69 / 73