Back to Benchmarks
Claude Opus 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4.1
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)
Anthropic
vs
GPT-4.1
OpenAI
GPT-4.1 wins 6 of 11
Index Scores
53.0
+26.7
Intelligence
26.3
48.1
+26.3
Coding
21.8
0.0
Math
+34.7
34.7
Benchmarks
0.0
MMLU Pro
+80.6
80.6
89.6
+23.0
GPQA
66.6
36.7
+32.1
HLE
4.6
0.0
LiveCode
+45.7
45.7
51.9
+13.8
SciCode
38.1
0.0
MATH-500
+91.3
91.3
0.0
AIME
+43.7
43.7
Speed
36
TPS
+8
44
Shared
Winning delta
crafted by
bart stefanski