Back to Benchmarks
GLM-5.1 (Reasoning) vs Llama 3.3 Instruct 70B
GLM-5.1 (Reasoning)
Z AI
via Z.AI
vs
Llama 3.3 Instruct 70B
Meta
via Groq
Llama 3.3 Instruct 70B
(Groq)
wins 6 of 11
Index Scores
51.4
+36.9
Intelligence
14.5
43.4
+32.7
Coding
10.7
0.0
Math
+7.7
7.7
Benchmarks
0.0
MMLU Pro
+71.3
71.3
86.8
+37.0
GPQA
49.8
28.0
+24.0
HLE
4.0
0.0
LiveCode
+28.8
28.8
43.8
+17.8
SciCode
26.0
0.0
MATH-500
+77.3
77.3
0.0
AIME
+30.0
30.0
Speed
21
TPS
+123
144
Shared
Winning delta
crafted by
bart stefanski