Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs GPT-4.1 mini

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

GPT-4.1 mini

OpenAI

GPT-4.1 mini wins 6 of 11

Index Scores

51.7
+28.8
Intelligence
22.9
50.9
+32.4
Coding
18.5
0.0
Math
+46.3
46.3

Benchmarks

0.0
MMLU Pro
+78.1
78.1
87.5
+21.1
GPQA
66.4
30.0
+25.4
HLE
4.6
0.0
LiveCode
+48.3
48.3
46.8
+6.4
SciCode
40.4
0.0
MATH-500
+92.5
92.5
0.0
AIME
+43.0
43.0

Speed

42
TPS
+9
51
Shared
Winning delta

crafted by bart stefanski