GraySoft
Model Comparison

ggml-org/gemma-4-31b-it-ggufvsunsloth/qwen3-8b-gguf

Side-by-side comparison of ggml-org/gemma-4-31b-it-gguf and unsloth/qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

ggml-org/gemma-4-31b-it-gguf

ggml-org · —

# gemma-4-31B-it-GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/gemma-4-31B-it-GGUF `` Then, access http://localhost:8080

unsloth/qwen3-8b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

ggml-org/gemma-4-31b-it-ggufunsloth/qwen3-8b-gguf
Authorggml-orgunsloth
Pipeline Tasktext-generation
Librarytransformers
Downloads52,52150,330
Likes34111
LicenseUnknownUnknown
Context Length
Created2026-04-012025-04-28
Last Modified2026-04-122025-06-08
Tags
ggufbase_model:google/gemma-4-31B-itbase_model:quantized:google/gemma-4-31B-itendpoints_compatibleregion:usconversational
transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8B

View full details: ggml-org/gemma-4-31b-it-gguf · unsloth/qwen3-8b-gguf