Back to Benchmarks

Claude Opus 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4.1

Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

GPT-4.1

OpenAI

GPT-4.1 wins 6 of 11

Index Scores

53.0
+26.7
Intelligence
26.3
48.1
+26.3
Coding
21.8
0.0
Math
+34.7
34.7

Benchmarks

0.0
MMLU Pro
+80.6
80.6
89.6
+23.0
GPQA
66.6
36.7
+32.1
HLE
4.6
0.0
LiveCode
+45.7
45.7
51.9
+13.8
SciCode
38.1
0.0
MATH-500
+91.3
91.3
0.0
AIME
+43.7
43.7

Speed

36
TPS
+8
44
Shared
Winning delta

crafted by bart stefanski