Claude Opus 4.6 (Adaptive Reasoning, Max Effort) vs DeepSeek R1 Distill Llama 70B | grube.ai

Back to Benchmarks

Claude Opus 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

DeepSeek R1 Distill Llama 70B

DeepSeekvia DeepInfra

DeepSeek R1 Distill Llama 70B(DeepInfra) wins 6 of 11

Index Scores

53.0

+37.0

Intelligence

16.0

48.1

+36.7

Coding

11.4

0.0

Math

+53.7

53.7

Benchmarks

0.0

MMLU Pro

+79.5

79.5

89.6

+49.4

GPQA

40.2

36.7

+30.6

HLE

6.1

0.0

LiveCode

+26.6

26.6

51.9

+20.7

SciCode

31.2

0.0

MATH-500

+93.5

93.5

0.0

AIME

+67.0

67.0

Speed

36

TPS

+6

42

Shared

Winning delta

crafted by bart stefanski