GraySoft
Model Comparison

ggml-org/embeddinggemma-300m-qat-q8_0-ggufvsluffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf

Side-by-side comparison of ggml-org/embeddinggemma-300m-qat-q8_0-gguf and luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf: downloads, license, context length, tasks, and benchmarks.

ggml-org/embeddinggemma-300m-qat-q8_0-gguf

ggml-org · feature-extraction

# embeddinggemma-300m-qat-q8_0 GGUF Recommended way to run this model: ``sh llama-server -hf ggml-org/embeddinggemma-300m-qat-q8_0-GGUF --embeddings ` Then the endpoint can be accessed at http://localhost:8080/embedding, for example using curl: `console curl --request POST \ --u…

luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf

LuffyTheFox · text-generation

Thinking is disabled by default in this model via modified chat template file baked in gguf. If you want to enable thinking set variable: {%- set enable_thinking = False %} to True in chat template. I extracted uncensored tensors made by HauhauCS via this script: https://pastebi…

Side-by-side Specifications

ggml-org/embeddinggemma-300m-qat-q8_0-ggufluffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf
Authorggml-orgLuffyTheFox
Pipeline Taskfeature-extractiontext-generation
Librarysentence-transformers
Downloads47,80679,621
Likes15100
LicenseUnknownUnknown
Context Length
Created2025-09-042026-03-15
Last Modified2025-09-152026-03-18
Tags
sentence-transformersggufsentence-similarityfeature-extractionbase_model:google/embeddinggemma-300m-qat-q8_0-unquantizedbase_model:quantized:google/embeddinggemma-300m-qat-q8_0-unquantizedlicense:gemmaendpoints_compatibleregion:us
ggufqwen3_5unslothqwenqwen3.5reasoningchain-of-thoughtlorauncensorednot-for-all-audiences

View full details: ggml-org/embeddinggemma-300m-qat-q8_0-gguf · luffythefox/qwen3.5-9b-claude-4.6-opus-uncensored-distilled-gguf