Back to Benchmarks
Claude 4.5 Haiku (Reasoning) vs Llama 3.3 Instruct 70B
Claude 4.5 Haiku (Reasoning)
Anthropic
vs
Llama 3.3 Instruct 70B
Meta
via Groq
Claude 4.5 Haiku (Reasoning) wins 8 of 11
Index Scores
37.1
+22.6
Intelligence
14.5
32.6
+21.9
Coding
10.7
83.7
+76.0
Math
7.7
Benchmarks
76.0
+4.7
MMLU Pro
71.3
67.2
+17.4
GPQA
49.8
9.7
+5.7
HLE
4.0
61.5
+32.7
LiveCode
28.8
43.3
+17.3
SciCode
26.0
0.0
MATH-500
+77.3
77.3
0.0
AIME
+30.0
30.0
Speed
66
TPS
+78
144
Shared
Winning delta
crafted by
bart stefanski