Back to Benchmarks

Claude Opus 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4.1 mini

Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

GPT-4.1 mini

OpenAI

GPT-4.1 mini wins 6 of 11

Index Scores

53.0
+30.1
Intelligence
22.9
48.1
+29.6
Coding
18.5
0.0
Math
+46.3
46.3

Benchmarks

0.0
MMLU Pro
+78.1
78.1
89.6
+23.2
GPQA
66.4
36.7
+32.1
HLE
4.6
0.0
LiveCode
+48.3
48.3
51.9
+11.5
SciCode
40.4
0.0
MATH-500
+92.5
92.5
0.0
AIME
+43.0
43.0

Speed

36
TPS
+15
51
Shared
Winning delta

crafted by bart stefanski