← All comparisons

GPT-5.1 Codex (high) vs Gemma 3 4B Instruct

OpenAI vs Google — side-by-side benchmark comparison

GPT-5.1 Codex (high)Gemma 3 4B Instruct
Intelligence Index43.16.3
Coding Index36.62.9
Math Index95.712.7
Output speed (tok/s)182.10.0
Blended price ($/1M)$3.44$0.05
Time to first token (s)5.42s0.00s
aime6.3%
aime 2595.7%12.7%
artificial analysis coding index36.602.90
artificial analysis intelligence index43.106.30
artificial analysis math index95.7012.70
gpqa86.0%29.1%
hle23.4%5.2%
ifbench70.0%28.3%
lcr67.3%5.7%
livecodebench84.9%11.2%
math 50076.6%
mmlu pro86.0%41.7%
scicode40.2%7.3%
tau283.0%5.0%
terminalbench hard34.8%0.8%

Benchmark data from Artificial Analysis.