Back to Benchmarks

gpt-oss-120B (high) vs Llama 4 Scout

gpt-oss-120B (high)

OpenAIvia Google

vs

Llama 4 Scout

Metavia Groq

gpt-oss-120B (high)(Google) wins 9 of 11

Index Scores

33.3
+19.8
Intelligence
13.5
28.6
+21.9
Coding
6.7
93.4
+79.4
Math
14.0

Benchmarks

80.8
+5.6
MMLU Pro
75.2
78.2
+19.5
GPQA
58.7
18.5
+14.2
HLE
4.3
87.8
+57.9
LiveCode
29.9
38.9
+21.9
SciCode
17.0
0.0
MATH-500
+84.4
84.4
0.0
AIME
+28.3
28.3

Speed

218
+118
TPS
100
Shared
Winning delta

crafted by bart stefanski