GraySoft
Model Comparison

mradermacher/mn-12b-mag-mell-r1-ggufvsunsloth/deepseek-r1-distill-llama-8b-gguf

Side-by-side comparison of mradermacher/mn-12b-mag-mell-r1-gguf and unsloth/deepseek-r1-distill-llama-8b-gguf: downloads, license, context length, tasks, and benchmarks.

mradermacher/mn-12b-mag-mell-r1-gguf

mradermacher · —

## About static quants of https://huggingface.co/inflatebot/MN-12B-Mag-Mell-R1 weighted/imatrix quants are available at https://huggingface.co/mradermacher/MN-12B-Mag-Mell-R1-i1-GGUF

unsloth/deepseek-r1-distill-llama-8b-gguf

unsloth · text-generation

We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb

Side-by-side Specifications

mradermacher/mn-12b-mag-mell-r1-ggufunsloth/deepseek-r1-distill-llama-8b-gguf
Authormradermacherunsloth
Pipeline Tasktext-generation
Librarytransformerstransformers
Downloads50,03341,456
Likes47295
LicenseUnknownUnknown
Context Length
Created2024-09-162025-01-20
Last Modified2024-09-172025-05-10
Tags
transformersggufmergekitmergeenbase_model:inflatebot/MN-12B-Mag-Mell-R1base_model:quantized:inflatebot/MN-12B-Mag-Mell-R1endpoints_compatibleregion:usconversational
transformersggufllamatext-generationdeepseekunslothllama-3metaenarxiv:2501.12948

View full details: mradermacher/mn-12b-mag-mell-r1-gguf · unsloth/deepseek-r1-distill-llama-8b-gguf