Back to Benchmarks

Claude Opus 4.6 (Adaptive Reasoning, Max Effort) vs DeepSeek R1 Distill Llama 70B

Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

DeepSeek R1 Distill Llama 70B

DeepSeekvia DeepInfra

DeepSeek R1 Distill Llama 70B(DeepInfra) wins 6 of 11

Index Scores

53.0
+37.0
Intelligence
16.0
48.1
+36.7
Coding
11.4
0.0
Math
+53.7
53.7

Benchmarks

0.0
MMLU Pro
+79.5
79.5
89.6
+49.4
GPQA
40.2
36.7
+30.6
HLE
6.1
0.0
LiveCode
+26.6
26.6
51.9
+20.7
SciCode
31.2
0.0
MATH-500
+93.5
93.5
0.0
AIME
+67.0
67.0

Speed

36
TPS
+6
42
Shared
Winning delta

crafted by bart stefanski