← All comparisons

Phi-4 Multimodal Instruct vs DeepSeek R1 0528 Qwen3 8B

Microsoft vs DeepSeek — side-by-side benchmark comparison

Phi-4 Multimodal InstructDeepSeek R1 0528 Qwen3 8B
Intelligence Index10.016.4
Coding Index7.8
Math Index63.7
Output speed (tok/s)16.60.0
Blended price ($/1M)$0.00$0.00
Time to first token (s)1.33s0.00s
aime9.3%65.0%
aime 2563.7%
artificial analysis coding index7.80
artificial analysis intelligence index10.0016.40
artificial analysis math index63.70
gpqa31.5%61.2%
hle4.4%5.6%
ifbench19.9%
lcr13.0%
livecodebench13.1%51.3%
math 50069.3%93.2%
mmlu pro48.5%73.9%
scicode11.0%20.4%
tau20.0%
terminalbench hard1.5%

Benchmark data from Artificial Analysis.