Model Comparison

ggml-org/gemma-4-31b-it-ggufvsunsloth/qwen3-8b-gguf

Side-by-side comparison of ggml-org/gemma-4-31b-it-gguf and unsloth/qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

ggml-org/gemma-4-31b-it-gguf

ggml-org · —

# gemma-4-31B-it-GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/gemma-4-31B-it-GGUF `` Then, access http://localhost:8080

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

	ggml-org/gemma-4-31b-it-gguf	unsloth/qwen3-8b-gguf
Author	ggml-org	unsloth
Pipeline Task	—	text-generation
Library	—	transformers
Downloads	52,521	50,330
Likes	34	111
License	Unknown	Unknown
Context Length	—	—
Created	2026-04-01	2025-04-28
Last Modified	2026-04-12	2025-06-08
Tags	ggufbase_model:google/gemma-4-31B-itbase_model:quantized:google/gemma-4-31B-itendpoints_compatibleregion:usconversational	transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8B

View full details: ggml-org/gemma-4-31b-it-gguf · unsloth/qwen3-8b-gguf