GraySoft
Model Comparison

unsloth/llama-3.2-1b-instruct-ggufvsunsloth/qwen3-4b-gguf

Side-by-side comparison of unsloth/llama-3.2-1b-instruct-gguf and unsloth/qwen3-4b-gguf: downloads, license, context length, tasks, and benchmarks.

unsloth/llama-3.2-1b-instruct-gguf

unsloth · text-generation

16bit, 8bit, 6bit, 5bit, 4bit, 3bit and 2bit uploads avaliable. # Finetune Llama 3.2, Gemma 2, Mistral 2-5x faster with 70% less memory via Unsloth! We have a free Google Colab Tesla T4 notebook for Llama 3.2 (3B) here: https://colab.research.google.com/drive/1T5-zKWM_5OD21QHwXH…

unsloth/qwen3-4b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

unsloth/llama-3.2-1b-instruct-ggufunsloth/qwen3-4b-gguf
Authorunslothunsloth
Pipeline Tasktext-generationtext-generation
Librarytransformerstransformers
Downloads64,31871,275
Likes58211
LicenseUnknownUnknown
Context Length
Created2024-09-252025-04-28
Last Modified2025-05-092025-06-08
Tags
transformersggufllamatext-generationllama-3metafacebookunslothenbase_model:meta-llama/Llama-3.2-1B-Instruct
transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-4Bbase_model:quantized:Qwen/Qwen3-4B

View full details: unsloth/llama-3.2-1b-instruct-gguf · unsloth/qwen3-4b-gguf