Launched ChatGPT, fastest consumer product to 100M users in history
| Benchmark | Score | Rank |
|---|---|---|
HellaSwag Common sense reasoning about everyday situations | 85.5% | #33 / 36 |
ARC-C Grade-school science questions requiring reasoning | 85.2% | #37 / 40 |
HumanEval Coding ability - generating correct Python functions | 48.1% | #46 / 49 |
MATH Competition-level mathematics problems | 35.2% | #47 / 49 |
MMLU Tests knowledge across 57 subjects from STEM to humanities | 70% | #49 / 53 |