Exaone 4.0 1.2B (Non-reasoning) vs Qwen3 VL 235B A22B (Reasoning)

LG AI Research vs Alibaba — side-by-side benchmark comparison

	Exaone 4.0 1.2B (Non-reasoning)	Qwen3 VL 235B A22B (Reasoning)
Intelligence Index	8.1	27.6
Coding Index	2.5	20.9
Math Index	24.0	88.3
Output speed (tok/s)	0.0	35.6
Blended price ($/1M)	$0.00	$2.17
Time to first token (s)	0.00s	5.14s
aime	—	—
aime 25	24.0%	88.3%
artificial analysis coding index	2.50	20.90
artificial analysis intelligence index	8.10	27.60
artificial analysis math index	24.00	88.30
gpqa	42.4%	77.2%
hle	5.8%	10.1%
ifbench	25.3%	56.5%
lcr	0.0%	58.7%
livecodebench	29.3%	64.6%
math 500	—	—
mmlu pro	50.0%	83.6%
scicode	7.4%	39.9%
tau2	20.5%	54.1%
terminalbench hard	0.0%	11.4%

Benchmark data from Artificial Analysis.