Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs DeepSeek R1 Distill Qwen 32B

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

DeepSeek R1 Distill Qwen 32B

DeepSeekvia NextBit

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins 6 of 11

Index Scores

51.7
+34.5
Intelligence
17.2
50.9
+50.9
Coding
0.0
0.0
Math
+63.0
63.0

Benchmarks

0.0
MMLU Pro
+73.9
73.9
87.5
+26.0
GPQA
61.5
30.0
+24.5
HLE
5.5
0.0
LiveCode
+27.0
27.0
46.8
+9.2
SciCode
37.6
0.0
MATH-500
+94.1
94.1
0.0
AIME
+68.7
68.7

Speed

42
+18
TPS
24
Shared
Winning delta

crafted by bart stefanski