GraySoft
Model Comparison

prism-ml/bonsai-8b-ggufvsunsloth/qwen3-vl-30b-a3b-instruct-gguf

Side-by-side comparison of prism-ml/bonsai-8b-gguf and unsloth/qwen3-vl-30b-a3b-instruct-gguf: downloads, license, context length, tasks, and benchmarks.

prism-ml/bonsai-8b-gguf

prism-ml · text-generation

End-to-end 1-bit language model for llama.cpp (CUDA, Metal, CPU) > **14.1x** smaller than FP16 | **6.2x** faster on RTX 4090 | **4-5x** lower energy/token

unsloth/qwen3-vl-30b-a3b-instruct-gguf

unsloth · image-text-to-text

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date. This generation delivers comprehensive upgrades across the board: superior text understanding & generation, deeper visual perception & reasoning, extended context length, enhanced spatial and vid…

Side-by-side Specifications

prism-ml/bonsai-8b-ggufunsloth/qwen3-vl-30b-a3b-instruct-gguf
Authorprism-mlunsloth
Pipeline Tasktext-generationimage-text-to-text
Libraryllama.cpp
Downloads83,30930,057
Likes61893
LicenseUnknownUnknown
Context Length
Created2026-03-182025-10-30
Last Modified2026-04-162026-01-01
Tags
llama.cppgguf1-bitllama-cppcudametalon-deviceprismmlbonsaitext-generation
ggufunslothqwenqwen3_vl_moeimage-text-to-textarxiv:2505.09388arxiv:2502.13923arxiv:2409.12191arxiv:2308.12966base_model:Qwen/Qwen3-VL-30B-A3B-Instruct

View full details: prism-ml/bonsai-8b-gguf · unsloth/qwen3-vl-30b-a3b-instruct-gguf