Back to Benchmarks

Claude Opus 4.7 (Adaptive Reasoning, Max Effort) vs DeepSeek R1 Distill Llama 70B

Claude Opus 4.7 (Adaptive Reasoning, Max Effort)

Anthropic

vs

DeepSeek R1 Distill Llama 70B

DeepSeekvia DeepInfra

Claude Opus 4.7 (Adaptive Reasoning, Max Effort) wins 6 of 11

Index Scores

57.3
+41.3
Intelligence
16.0
52.5
+41.1
Coding
11.4
0.0
Math
+53.7
53.7

Benchmarks

0.0
MMLU Pro
+79.5
79.5
91.4
+51.2
GPQA
40.2
39.6
+33.5
HLE
6.1
0.0
LiveCode
+26.6
26.6
54.5
+23.3
SciCode
31.2
0.0
MATH-500
+93.5
93.5
0.0
AIME
+67.0
67.0

Speed

52
+10
TPS
42
Shared
Winning delta

crafted by bart stefanski