← All comparisons

Gemma 4 E4B (Reasoning) vs Claude Opus 4.8 (Adaptive Reasoning, Max Effort)

Google vs Anthropic — side-by-side benchmark comparison

Gemma 4 E4B (Reasoning)Claude Opus 4.8 (Adaptive Reasoning, Max Effort)
Intelligence Index18.861.4
Coding Index13.756.7
Math Index
Output speed (tok/s)0.066.9
Blended price ($/1M)$0.00$10.94
Time to first token (s)0.00s7.91s
aime
aime 25
artificial analysis coding index13.7056.70
artificial analysis intelligence index18.8061.40
artificial analysis math index
gpqa57.6%92.0%
hle3.7%45.7%
ifbench44.2%62.2%
lcr30.7%67.7%
livecodebench
math 500
mmlu pro
scicode24.4%53.5%
tau220.8%94.4%
terminalbench hard8.3%58.3%

Benchmark data from Artificial Analysis.