Back to Benchmarks
Qwen3.5 397B A17B (Reasoning) vs GPT-4.1
Qwen3.5 397B A17B (Reasoning)
Alibaba
vs
GPT-4.1
OpenAI
Qwen3.5 397B A17B (Reasoning) wins 6 of 11
Index Scores
45.0
+18.7
Intelligence
26.3
41.3
+19.5
Coding
21.8
0.0
Math
+34.7
34.7
Benchmarks
0.0
MMLU Pro
+80.6
80.6
89.3
+22.7
GPQA
66.6
27.3
+22.7
HLE
4.6
0.0
LiveCode
+45.7
45.7
42.0
+3.9
SciCode
38.1
0.0
MATH-500
+91.3
91.3
0.0
AIME
+43.7
43.7
Speed
53
+9
TPS
44
Shared
Winning delta
crafted by
bart stefanski