GraySoft
Model Comparison

sugoitoolkit/sugoi-14b-ultra-ggufvsunsloth/deepseek-r1-distill-llama-8b-gguf

Side-by-side comparison of sugoitoolkit/sugoi-14b-ultra-gguf and unsloth/deepseek-r1-distill-llama-8b-gguf: downloads, license, context length, tasks, and benchmarks.

sugoitoolkit/sugoi-14b-ultra-gguf

sugoitoolkit · translation

Unleashing the full potential of the previous sugoi 14B model, **Sugoi 14B Ultra** delivers near-double translation accuracy compared to its quantized predecessor—achieving a BLEU score of **21.38 vs 13.67**. Its prompt-following skills rival those of Qwen 2.5 Base, especially w…

unsloth/deepseek-r1-distill-llama-8b-gguf

unsloth · text-generation

We have a free Google Colab notebook for turning Llama 3.1 (8B) into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb

Side-by-side Specifications

sugoitoolkit/sugoi-14b-ultra-ggufunsloth/deepseek-r1-distill-llama-8b-gguf
Authorsugoitoolkitunsloth
Pipeline Tasktranslationtext-generation
Librarytransformers
Downloads149,47341,456
Likes8295
LicenseUnknownUnknown
Context Length
Created2025-08-192025-01-20
Last Modified2025-08-262025-05-10
Tags
gguftranslationjaenbase_model:sugoitoolkit/Sugoi-14B-Ultra-HFbase_model:quantized:sugoitoolkit/Sugoi-14B-Ultra-HFlicense:apache-2.0endpoints_compatibleregion:usconversational
transformersggufllamatext-generationdeepseekunslothllama-3metaenarxiv:2501.12948

View full details: sugoitoolkit/sugoi-14b-ultra-gguf · unsloth/deepseek-r1-distill-llama-8b-gguf