Back to Benchmarks
Claude Opus 4.7 (Adaptive Reasoning, Max Effort) vs GPT-4.1
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Anthropic
vs
GPT-4.1
OpenAI
Claude Opus 4.7 (Adaptive Reasoning, Max Effort) wins 6 of 11
Index Scores
57.3
+31.0
Intelligence
26.3
52.5
+30.7
Coding
21.8
0.0
Math
+34.7
34.7
Benchmarks
0.0
MMLU Pro
+80.6
80.6
91.4
+24.8
GPQA
66.6
39.6
+35.0
HLE
4.6
0.0
LiveCode
+45.7
45.7
54.5
+16.4
SciCode
38.1
0.0
MATH-500
+91.3
91.3
0.0
AIME
+43.7
43.7
Speed
52
+8
TPS
44
Shared
Winning delta
crafted by
bart stefanski