GraySoft
Projects Models Compare Cloud benchmarks FAQ Download guIDE →

Google cloud models

56 models tracked via Artificial Analysis. Compare cloud performance, then find local GGUF versions in the GraySoft model catalog.

ModelIntelligenceSpeed (tok/s)
Gemini 3.1 Pro Preview57.2132.199
Gemini 3.5 Flash (high)55.3216.703
Gemini 3.5 Flash (medium)54.8206.66
Gemini 3 Pro Preview (high)48.40
Gemini 3 Flash Preview (Reasoning)46.4203.899
Gemini 3.5 Flash (minimal)43.3200.523
Gemini 3 Pro Preview (low)41.30
Gemma 4 31B (Reasoning)39.235.122
Gemini 3 Flash Preview (Non-reasoning)35190.685
Gemini 2.5 Pro34.6132.023
Gemini 3.1 Flash-Lite33.5324.745
Gemma 4 31B (Non-reasoning)32.342.276
Gemma 4 26B A4B (Reasoning)31.20
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)31.10
Gemini 2.5 Pro Preview (Mar' 25)30.30
Gemini 2.5 Pro Preview (May' 25)29.50
Gemma 4 12B (Reasoning)29.2160.384
Gemma 4 26B A4B (Non-reasoning)27.156.811
Gemini 2.5 Flash (Reasoning)27227.202
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)25.70
Gemini 2.5 Flash Preview (Reasoning)24.30
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)21.60
Gemini 2.5 Flash (Non-reasoning)20.6199.686
Gemini 2.0 Flash Thinking Experimental (Jan '25)19.60
Gemma 4 12B (Non-reasoning)19.50
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)19.40
Gemma 4 E4B (Reasoning)18.80
Gemini 2.0 Flash (Feb '25)18.50
Gemini 2.0 Pro Experimental (Feb '25)18.10
Gemini 2.5 Flash Preview (Non-reasoning)17.80
Gemini 2.5 Flash-Lite (Reasoning)17.6268.199
Gemini 2.0 Flash (experimental)16.80
Gemini 1.5 Pro (Sep '24)160
Gemma 4 E2B (Reasoning)15.20
Gemma 4 E4B (Non-reasoning)14.80
Gemini 2.0 Flash-Lite (Feb '25)14.70
Gemini 2.0 Flash-Lite (Preview)14.50
Gemini 1.5 Flash (Sep '24)13.80
Gemini 2.5 Flash-Lite (Non-reasoning)12.7229.022
Gemini 2.0 Flash Thinking Experimental (Dec '24)12.30

Run models locally with guIDE

Download guIDE — the AI-native code editor with local LLM inference and 69 built-in tools.

Download guIDE → · Browse 524k+ models · Compare models