← All comparisons

gpt-oss-120b (high) vs Qwen3.5 397B A17B (Non-reasoning)

OpenAI vs Alibaba — side-by-side benchmark comparison

gpt-oss-120b (high)Qwen3.5 397B A17B (Non-reasoning)
Intelligence Index33.340.1
Coding Index28.637.4
Math Index93.4
Output speed (tok/s)356.853.5
Blended price ($/1M)$0.26$1.35
Time to first token (s)0.51s1.85s
aime
aime 2593.4%
artificial analysis coding index28.6037.40
artificial analysis intelligence index33.3040.10
artificial analysis math index93.40
gpqa78.2%86.1%
hle18.5%18.8%
ifbench69.0%51.6%
lcr50.7%58.0%
livecodebench87.8%
math 500
mmlu pro80.8%
scicode38.9%41.1%
tau265.8%83.9%
terminalbench hard23.5%35.6%

Benchmark data from Artificial Analysis.