GraySoft
Model Comparison

mradermacher/total04-deepseek-r1-distill-llama-70b-heretic-i1-ggufvsunsloth/qwen3-4b-gguf

Side-by-side comparison of mradermacher/total04-deepseek-r1-distill-llama-70b-heretic-i1-gguf and unsloth/qwen3-4b-gguf: downloads, license, context length, tasks, and benchmarks.

mradermacher/total04-deepseek-r1-distill-llama-70b-heretic-i1-gguf

mradermacher · —

## About weighted/imatrix quants of https://huggingface.co/CCSSNE/Total04-DeepSeek-R1-Distill-Llama-70B-heretic ***For a convenient overview and download list, visit our model page for this model.*** static quants are available at https://huggingface.co/mradermacher/Total04-Deep…

unsloth/qwen3-4b-gguf

unsloth · text-generation

If you are using llama.cpp, Ollama, Open WebUI etc., you can add /think and /no_think to user prompts or system messages to switch the model's thinking mode from turn to turn. The model will follow the most recent instruction in multi-turn conversations. Here is an example of mu…

Side-by-side Specifications

mradermacher/total04-deepseek-r1-distill-llama-70b-heretic-i1-ggufunsloth/qwen3-4b-gguf
Authormradermacherunsloth
Pipeline Tasktext-generation
Librarytransformerstransformers
Downloads21,30271,275
Likes0211
LicenseUnknownUnknown
Context Length
Created2026-04-022025-04-28
Last Modified2026-04-032025-06-08
Tags
transformersggufhereticuncensoreddecensoredabliteratedenbase_model:CCSSNE/Total04-DeepSeek-R1-Distill-Llama-70B-hereticbase_model:quantized:CCSSNE/Total04-DeepSeek-R1-Distill-Llama-70B-hereticlicense:mit
transformersggufqwen3text-generationqwenunslothenarxiv:2309.00071base_model:Qwen/Qwen3-4Bbase_model:quantized:Qwen/Qwen3-4B

View full details: mradermacher/total04-deepseek-r1-distill-llama-70b-heretic-i1-gguf · unsloth/qwen3-4b-gguf