Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs Llama 4 Scout

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

Llama 4 Scout

Metavia Groq

Llama 4 Scout(Groq) wins 6 of 11

Index Scores

51.7
+38.2
Intelligence
13.5
50.9
+44.2
Coding
6.7
0.0
Math
+14.0
14.0

Benchmarks

0.0
MMLU Pro
+75.2
75.2
87.5
+28.8
GPQA
58.7
30.0
+25.7
HLE
4.3
0.0
LiveCode
+29.9
29.9
46.8
+29.8
SciCode
17.0
0.0
MATH-500
+84.4
84.4
0.0
AIME
+28.3
28.3

Speed

42
TPS
+58
100
Shared
Winning delta

crafted by bart stefanski