Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs DeepSeek R1 Distill Qwen 32B | grube.ai

Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

DeepSeek R1 Distill Qwen 32B

DeepSeekvia NextBit

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins 6 of 11

Index Scores

51.7

+34.5

Intelligence

17.2

50.9

+50.9

Coding

0.0

0.0

Math

+63.0

63.0

Benchmarks

0.0

MMLU Pro

+73.9

73.9

87.5

+26.0

GPQA

61.5

30.0

+24.5

HLE

5.5

0.0

LiveCode

+27.0

27.0

46.8

+9.2

SciCode

37.6

0.0

MATH-500

+94.1

94.1

0.0

AIME

+68.7

68.7

Speed

42

+18

TPS

24

Shared

Winning delta

crafted by bart stefanski