GraySoft
Model Comparison

prism-ml/bonsai-8b-ggufvsunsloth/deepseek-r1-distill-qwen-1.5b-gguf

Side-by-side comparison of prism-ml/bonsai-8b-gguf and unsloth/deepseek-r1-distill-qwen-1.5b-gguf: downloads, license, context length, tasks, and benchmarks.

prism-ml/bonsai-8b-gguf

prism-ml · text-generation

End-to-end 1-bit language model for llama.cpp (CUDA, Metal, CPU) > **14.1x** smaller than FP16 | **6.2x** faster on RTX 4090 | **4-5x** lower energy/token

unsloth/deepseek-r1-distill-qwen-1.5b-gguf

unsloth · text-generation

We have a free Google Colab Tesla T4 notebook for Llama 3.1 (8B) here: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-Alpaca.ipynb

Side-by-side Specifications

prism-ml/bonsai-8b-ggufunsloth/deepseek-r1-distill-qwen-1.5b-gguf
Authorprism-mlunsloth
Pipeline Tasktext-generationtext-generation
Libraryllama.cpptransformers
Downloads83,30990,618
Likes618133
LicenseUnknownUnknown
Context Length
Created2026-03-182025-01-20
Last Modified2026-04-162025-04-19
Tags
llama.cppgguf1-bitllama-cppcudametalon-deviceprismmlbonsaitext-generation
transformersggufqwen2text-generationdeepseekqwenunslothenarxiv:2501.12948base_model:deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

View full details: prism-ml/bonsai-8b-gguf · unsloth/deepseek-r1-distill-qwen-1.5b-gguf