GLM-5.1 (Reasoning) vs GLM-4.6V (Non-reasoning)

Z AI vs Z AI — side-by-side benchmark comparison

	GLM-5.1 (Reasoning)	GLM-4.6V (Non-reasoning)
Intelligence Index	51.4	17.1
Coding Index	43.4	11.1
Math Index	—	26.3
Output speed (tok/s)	61.2	38.5
Blended price ($/1M)	$2.15	$0.45
Time to first token (s)	0.86s	1.39s
aime	—	—
aime 25	—	26.3%
artificial analysis coding index	43.40	11.10
artificial analysis intelligence index	51.40	17.10
artificial analysis math index	—	26.30
gpqa	86.8%	56.6%
hle	28.0%	3.7%
ifbench	76.3%	27.9%
lcr	62.3%	12.3%
livecodebench	—	41.1%
math 500	—	—
mmlu pro	—	75.2%
scicode	43.8%	27.2%
tau2	97.7%	30.7%
terminalbench hard	43.2%	3.0%

Benchmark data from Artificial Analysis.