Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4.1

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

GPT-4.1

OpenAI

GPT-4.1 wins 6 of 11

Index Scores

51.7
+25.4
Intelligence
26.3
50.9
+29.1
Coding
21.8
0.0
Math
+34.7
34.7

Benchmarks

0.0
MMLU Pro
+80.6
80.6
87.5
+20.9
GPQA
66.6
30.0
+25.4
HLE
4.6
0.0
LiveCode
+45.7
45.7
46.8
+8.7
SciCode
38.1
0.0
MATH-500
+91.3
91.3
0.0
AIME
+43.7
43.7

Speed

42
TPS
+2
44
Shared
Winning delta

crafted by bart stefanski