GraySoft
Model Comparison

bartowski/deepseek-r1-distill-qwen-7b-ggufvsunsloth/qwen3-8b-gguf

Side-by-side comparison of bartowski/deepseek-r1-distill-qwen-7b-gguf and unsloth/qwen3-8b-gguf: downloads, license, context length, tasks, and benchmarks.

bartowski/deepseek-r1-distill-qwen-7b-gguf

bartowski · text-generation

unsloth/qwen3-8b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

bartowski/deepseek-r1-distill-qwen-7b-ggufunsloth/qwen3-8b-gguf
Authorbartowskiunsloth
Pipeline Tasktext-generationtext-generation
Librarytransformers
Downloads38,56950,330
Likes115111
LicenseUnknownUnknown
Context Length
Created2025-01-202025-04-28
Last Modified2025-03-072025-06-08
Tags
gguftext-generationbase_model:deepseek-ai/DeepSeek-R1-Distill-Qwen-7Bbase_model:quantized:deepseek-ai/DeepSeek-R1-Distill-Qwen-7Bendpoints_compatibleregion:usimatrixconversational
transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-8Bbase_model:quantized:Qwen/Qwen3-8B

View full details: bartowski/deepseek-r1-distill-qwen-7b-gguf · unsloth/qwen3-8b-gguf