OpenAIDecember 17, 2024

o1

Name: o1
Price: 15 USD
Author: OpenAI

First reasoning model, uses chain-of-thought at inference time to solve hard problems

Singularity Index

56.6

RANK 58 / 71

Reasoning

30%

Coding

25%

Agentic

25%

Multimodal

—

10%

General

10%

BENCHMARKS

Benchmark	Score	Rank
ARC-C Grade-school science questions requiring reasoning	97.8%	#11 / 40
MMLU Tests knowledge across 57 subjects from STEM to humanities	91.8%	#12 / 54
MATH Competition-level mathematics problems	96.4%	#13 / 50
ARC-AGIARC Prize Novel reasoning tasks requiring fluid intelligence	0.8%	#23 / 23
HumanEval Coding ability - generating correct Python functions	92.4%	#26 / 50
Arena Elo Human preference ranking via blind comparisons	1350	#37 / 52
SWE-bench Real-world GitHub issue resolution	48.9%	#38 / 40
GPQA PhD-level science questions even experts struggle with	78%	#41 / 64

Input

$15

per 1M tokens

Output

$60

per 1M tokens

Context

200K

tokens