← All comparisons

Llama 3.1 Instruct 405B vs Qwen3 235B A22B (Reasoning)

Meta vs Alibaba — side-by-side benchmark comparison

Llama 3.1 Instruct 405BQwen3 235B A22B (Reasoning)
Intelligence Index17.419.8
Coding Index14.517.4
Math Index3.082.0
Output speed (tok/s)37.558.3
Blended price ($/1M)$3.69$2.63
Time to first token (s)0.63s1.37s
aime21.3%84.0%
aime 253.0%82.0%
artificial analysis coding index14.5017.40
artificial analysis intelligence index17.4019.80
artificial analysis math index3.0082.00
gpqa51.5%70.0%
hle4.2%11.7%
ifbench39.0%38.7%
lcr24.3%0.0%
livecodebench30.5%62.2%
math 50070.3%93.0%
mmlu pro73.2%82.8%
scicode29.9%39.9%
tau219.0%24.0%
terminalbench hard6.8%6.1%

Benchmark data from Artificial Analysis.