Back to Benchmarks
Qwen3.5 397B A17B (Reasoning) vs Llama 3.3 Instruct 70B
Qwen3.5 397B A17B (Reasoning)
Alibaba
vs
Llama 3.3 Instruct 70B
Meta
via Groq
Llama 3.3 Instruct 70B
(Groq)
wins 6 of 11
Index Scores
45.0
+30.5
Intelligence
14.5
41.3
+30.6
Coding
10.7
0.0
Math
+7.7
7.7
Benchmarks
0.0
MMLU Pro
+71.3
71.3
89.3
+39.5
GPQA
49.8
27.3
+23.3
HLE
4.0
0.0
LiveCode
+28.8
28.8
42.0
+16.0
SciCode
26.0
0.0
MATH-500
+77.3
77.3
0.0
AIME
+30.0
30.0
Speed
53
TPS
+91
144
Shared
Winning delta
crafted by
bart stefanski