Llama 3.1 Instruct 405B vs Qwen3.5 27B (Non-reasoning)

Meta vs Alibaba — side-by-side benchmark comparison

	Llama 3.1 Instruct 405B	Qwen3.5 27B (Non-reasoning)
Intelligence Index	17.4	37.2
Coding Index	14.5	33.4
Math Index	3.0	—
Output speed (tok/s)	37.5	95.3
Blended price ($/1M)	$3.69	$0.88
Time to first token (s)	0.63s	1.40s
aime	21.3%	—
aime 25	3.0%	—
artificial analysis coding index	14.50	33.40
artificial analysis intelligence index	17.40	37.20
artificial analysis math index	3.00	—
gpqa	51.5%	84.2%
hle	4.2%	13.2%
ifbench	39.0%	46.9%
lcr	24.3%	55.7%
livecodebench	30.5%	—
math 500	70.3%	—
mmlu pro	73.2%	—
scicode	29.9%	36.7%
tau2	19.0%	87.1%
terminalbench hard	6.8%	31.8%

Benchmark data from Artificial Analysis.