Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4o (Aug '24)

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

GPT-4o (Aug '24)

OpenAI

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins 6 of 11

Index Scores

51.7
+33.1
Intelligence
18.6
50.9
+34.3
Coding
16.6
0.0
Math
0.0

Benchmarks

0.0
MMLU Pro
0.0
87.5
+35.4
GPQA
52.1
30.0
+27.1
HLE
2.9
0.0
LiveCode
+31.7
31.7
46.8
+13.7
SciCode
33.1
0.0
MATH-500
+79.5
79.5
0.0
AIME
+11.7
11.7

Speed

42
+28
TPS
14
Shared
Winning delta

crafted by bart stefanski