← All comparisons

GPT-4o Realtime (Dec '24) vs Qwen3 4B (Non-reasoning)

OpenAI vs Alibaba — side-by-side benchmark comparison

GPT-4o Realtime (Dec '24)Qwen3 4B (Non-reasoning)
Intelligence Index12.5
Coding Index
Math Index
Output speed (tok/s)0.0103.5
Blended price ($/1M)$0.00$0.19
Time to first token (s)0.00s1.02s
aime21.3%
aime 25
artificial analysis coding index
artificial analysis intelligence index12.50
artificial analysis math index
gpqa39.8%
hle3.7%
ifbench
lcr
livecodebench23.3%
math 50084.3%
mmlu pro58.6%
scicode16.7%
tau2
terminalbench hard

Benchmark data from Artificial Analysis.