← All comparisons

Gemini 2.5 Flash Preview (Sep '25) (Reasoning) vs Claude 4.1 Opus (Non-reasoning)

Google vs Anthropic — side-by-side benchmark comparison

Gemini 2.5 Flash Preview (Sep '25) (Reasoning)Claude 4.1 Opus (Non-reasoning)
Intelligence Index31.136.0
Coding Index24.6
Math Index78.3
Output speed (tok/s)0.044.7
Blended price ($/1M)$0.00$32.81
Time to first token (s)0.00s1.63s
aime
aime 2578.3%
artificial analysis coding index24.60
artificial analysis intelligence index31.1036.00
artificial analysis math index78.30
gpqa79.3%
hle12.7%
ifbench52.3%
lcr64.3%
livecodebench71.3%
math 500
mmlu pro84.2%
scicode40.5%
tau245.6%
terminalbench hard16.7%

Benchmark data from Artificial Analysis.