Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs Llama 3.3 Instruct 70B

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

Llama 3.3 Instruct 70B

Metavia Groq

Llama 3.3 Instruct 70B(Groq) wins 6 of 11

Index Scores

51.7
+37.2
Intelligence
14.5
50.9
+40.2
Coding
10.7
0.0
Math
+7.7
7.7

Benchmarks

0.0
MMLU Pro
+71.3
71.3
87.5
+37.7
GPQA
49.8
30.0
+26.0
HLE
4.0
0.0
LiveCode
+28.8
28.8
46.8
+20.8
SciCode
26.0
0.0
MATH-500
+77.3
77.3
0.0
AIME
+30.0
30.0

Speed

42
TPS
+102
144
Shared
Winning delta

crafted by bart stefanski