Back to Benchmarks

Qwen3.5 397B A17B (Reasoning) vs Llama 3.3 Instruct 70B

Qwen3.5 397B A17B (Reasoning)

Alibaba

vs

Llama 3.3 Instruct 70B

Metavia Groq

Llama 3.3 Instruct 70B(Groq) wins 6 of 11

Index Scores

45.0
+30.5
Intelligence
14.5
41.3
+30.6
Coding
10.7
0.0
Math
+7.7
7.7

Benchmarks

0.0
MMLU Pro
+71.3
71.3
89.3
+39.5
GPQA
49.8
27.3
+23.3
HLE
4.0
0.0
LiveCode
+28.8
28.8
42.0
+16.0
SciCode
26.0
0.0
MATH-500
+77.3
77.3
0.0
AIME
+30.0
30.0

Speed

53
TPS
+91
144
Shared
Winning delta

crafted by bart stefanski