GraySoft
Model Comparison

unsloth/deepseek-r1-distill-llama-8b-ggufvsunsloth/qwen3-1.7b-gguf

Side-by-side comparison of unsloth/deepseek-r1-distill-llama-8b-gguf and unsloth/qwen3-1.7b-gguf: downloads, license, context length, tasks, and benchmarks.

unsloth/deepseek-r1-distill-llama-8b-gguf

unsloth · text-generation

We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb

unsloth/qwen3-1.7b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

unsloth/deepseek-r1-distill-llama-8b-ggufunsloth/qwen3-1.7b-gguf
Authorunslothunsloth
Pipeline Tasktext-generationtext-generation
Librarytransformerstransformers
Downloads41,45622,966
Likes29566
LicenseUnknownUnknown
Context Length
Created2025-01-202025-04-28
Last Modified2025-05-102025-06-08
Tags
transformersggufllamatext-generationdeepseekunslothllama-3metaenarxiv:2501.12948
transformersggufqwen3text-generationqwenunslothenbase_model:Qwen/Qwen3-1.7Bbase_model:quantized:Qwen/Qwen3-1.7Blicense:apache-2.0

View full details: unsloth/deepseek-r1-distill-llama-8b-gguf · unsloth/qwen3-1.7b-gguf