Back to Benchmarks

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) vs gpt-oss-120B (high)

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Anthropic

vs

gpt-oss-120B (high)

OpenAIvia Google

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins 5 of 11

Index Scores

51.7
+18.4
Intelligence
33.3
50.9
+22.3
Coding
28.6
0.0
Math
+93.4
93.4

Benchmarks

0.0
MMLU Pro
+80.8
80.8
87.5
+9.3
GPQA
78.2
30.0
+11.5
HLE
18.5
0.0
LiveCode
+87.8
87.8
46.8
+7.9
SciCode
38.9
0.0
MATH-500
0.0
0.0
AIME
0.0

Speed

42
TPS
+176
218
Shared
Winning delta

crafted by bart stefanski